Warning: Permanently added '98.92.214.254' (ED25519) to the list of known hosts. You can reproduce this build on your computer by running: sudo dnf install copr-rpmbuild /usr/bin/copr-rpmbuild --verbose --drop-resultdir --task-url https://copr.fedorainfracloud.org/backend/get-build-task/9938613-rhel+epel-10-x86_64 --chroot rhel+epel-10-x86_64 Version: 1.6 PID: 8819 Logging PID: 8821 Task: {'allow_user_ssh': False, 'appstream': False, 'background': True, 'bootstrap': 'off', 'build_id': 9938613, 'buildroot_pkgs': [], 'chroot': 'rhel+epel-10-x86_64', 'enable_net': False, 'fedora_review': False, 'git_hash': '8f047148f34ff9178fdaa68475c237e696e6c2cd', 'git_repo': 'https://copr-dist-git.fedorainfracloud.org/git/dchen/el-pkgs/llama-cpp', 'isolation': 'default', 'memory_reqs': 2048, 'package_name': 'llama-cpp', 'package_version': 'b6153-1', 'project_dirname': 'el-pkgs', 'project_name': 'el-pkgs', 'project_owner': 'dchen', 'repo_priority': None, 'repos': [{'baseurl': 'https://download.copr.fedorainfracloud.org/results/dchen/el-pkgs/rhel+epel-10-x86_64/', 'id': 'copr_base', 'name': 'Copr repository', 'priority': None}], 'sandbox': 'dchen/el-pkgs--https://src.fedoraproject.org/user/trix', 'source_json': {}, 'source_type': None, 'ssh_public_keys': None, 'storage': 0, 'submitter': 'https://src.fedoraproject.org/user/trix', 'tags': [], 'task_id': '9938613-rhel+epel-10-x86_64', 'timeout': 18000, 'uses_devel_repo': False, 'with_opts': [], 'without_opts': []} Running: git clone https://copr-dist-git.fedorainfracloud.org/git/dchen/el-pkgs/llama-cpp /var/lib/copr-rpmbuild/workspace/workdir-k0p141h_/llama-cpp --depth 500 --no-single-branch --recursive cmd: ['git', 'clone', 'https://copr-dist-git.fedorainfracloud.org/git/dchen/el-pkgs/llama-cpp', '/var/lib/copr-rpmbuild/workspace/workdir-k0p141h_/llama-cpp', '--depth', '500', '--no-single-branch', '--recursive'] cwd: . rc: 0 stdout: stderr: Cloning into '/var/lib/copr-rpmbuild/workspace/workdir-k0p141h_/llama-cpp'... Running: git checkout 8f047148f34ff9178fdaa68475c237e696e6c2cd -- cmd: ['git', 'checkout', '8f047148f34ff9178fdaa68475c237e696e6c2cd', '--'] cwd: /var/lib/copr-rpmbuild/workspace/workdir-k0p141h_/llama-cpp rc: 0 stdout: stderr: Note: switching to '8f047148f34ff9178fdaa68475c237e696e6c2cd'. You are in 'detached HEAD' state. You can look around, make experimental changes and commit them, and you can discard any commits you make in this state without impacting any branches by switching back to a branch. If you want to create a new branch to retain commits you create, you may do so (now or later) by using -c with the switch command. Example: git switch -c Or undo this operation with: git switch - Turn off this advice by setting config variable advice.detachedHead to false HEAD is now at 8f04714 automatic import of llama-cpp Running: dist-git-client sources cmd: ['dist-git-client', 'sources'] cwd: /var/lib/copr-rpmbuild/workspace/workdir-k0p141h_/llama-cpp rc: 0 stdout: stderr: INFO: Reading stdout from command: git rev-parse --abbrev-ref HEAD INFO: Reading stdout from command: git rev-parse HEAD INFO: Reading sources specification file: sources INFO: Downloading llama.cpp-b6153.tar.gz INFO: Reading stdout from command: curl --help all INFO: Calling: curl -H Pragma: -o llama.cpp-b6153.tar.gz --location --connect-timeout 60 --retry 3 --retry-delay 10 --remote-time --show-error --fail --retry-all-errors https://copr-dist-git.fedorainfracloud.org/repo/pkgs/dchen/el-pkgs/llama-cpp/llama.cpp-b6153.tar.gz/md5/e7eae951975b13b8eed5bb4264c632cc/llama.cpp-b6153.tar.gz % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 100 24.3M 100 24.3M 0 0 438M 0 --:--:-- --:--:-- --:--:-- 441M INFO: Reading stdout from command: md5sum llama.cpp-b6153.tar.gz tail: /var/lib/copr-rpmbuild/main.log: file truncated Running (timeout=18000): unbuffer mock --spec /var/lib/copr-rpmbuild/workspace/workdir-k0p141h_/llama-cpp/llama-cpp.spec --sources /var/lib/copr-rpmbuild/workspace/workdir-k0p141h_/llama-cpp --resultdir /var/lib/copr-rpmbuild/results --uniqueext 1766268696.705688 -r /var/lib/copr-rpmbuild/results/configs/child.cfg INFO: mock.py version 6.6 starting (python version = 3.13.7, NVR = mock-6.6-1.fc42), args: /usr/libexec/mock/mock --spec /var/lib/copr-rpmbuild/workspace/workdir-k0p141h_/llama-cpp/llama-cpp.spec --sources /var/lib/copr-rpmbuild/workspace/workdir-k0p141h_/llama-cpp --resultdir /var/lib/copr-rpmbuild/results --uniqueext 1766268696.705688 -r /var/lib/copr-rpmbuild/results/configs/child.cfg Start: init plugins INFO: tmpfs initialized INFO: selinux enabled INFO: chroot_scan: initialized INFO: compress_logs: initialized Finish: init plugins INFO: Signal handler active Start: run INFO: Start(/var/lib/copr-rpmbuild/workspace/workdir-k0p141h_/llama-cpp/llama-cpp.spec) Config(rhel+epel-10-x86_64) Start: clean chroot Finish: clean chroot Mock Version: 6.6 INFO: Mock Version: 6.6 Start: chroot init INFO: mounting tmpfs at /var/lib/mock/rhel+epel-10-x86_64-1766268696.705688/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start: cleaning package manager metadata Finish: cleaning package manager metadata INFO: enabled HW Info plugin INFO: Package manager dnf4 detected and used (fallback) INFO: Buildroot is handled by package management from host and used with --installroot: rpm-4.20.1-1.fc42.x86_64 rpm-sequoia-1.7.0-5.fc42.x86_64 python3-dnf-4.24.0-1.fc42.noarch python3-dnf-plugins-core-4.10.1-1.fc42.noarch dnf5-5.2.17.0-1.fc42.x86_64 dnf5-plugins-5.2.17.0-1.fc42.x86_64 Start: installing minimal buildroot with dnf No matches found for the following disable plugin patterns: local, spacewalk, versionlock Updating Subscription Management repositories. Unable to read consumer identity This system is not registered with an entitlement server. You can use subscription-manager to register. Copr repository 56 kB/s | 47 kB 00:00 Red Hat Enterprise Linux 10 for x86_64 - BaseOS 122 MB/s | 40 MB 00:00 Red Hat Enterprise Linux 10 for x86_64 - AppStr 15 MB/s | 4.1 MB 00:00 Red Hat CodeReady Linux Builder for RHEL 10 x86 3.4 MB/s | 896 kB 00:00 Extra Packages for Enterprise Linux 10 - x86_64 16 MB/s | 5.6 MB 00:00 Dependencies resolved. ======================================================================================= Package Arch Version Repo Size ======================================================================================= Installing: bash x86_64 5.2.26-6.el10 baseos 1.8 M bzip2 x86_64 1.0.8-25.el10 baseos 59 k coreutils x86_64 9.5-6.el10 baseos 1.1 M cpio x86_64 2.15-3.el10 baseos 296 k diffutils x86_64 3.10-8.el10 baseos 413 k epel-rpm-macros noarch 10-6.el10_1 epel 8.3 k findutils x86_64 1:4.10.0-5.el10 baseos 555 k gawk x86_64 5.3.0-6.el10 baseos 1.1 M glibc-minimal-langpack x86_64 2.39-58.el10_1.2 baseos 45 k grep x86_64 3.11-10.el10 baseos 305 k gzip x86_64 1.13-3.el10 baseos 174 k info x86_64 7.1-6.el10 baseos 187 k patch x86_64 2.7.6-26.el10 appstream 134 k redhat-release x86_64 10.1-18.el10 baseos 61 k redhat-rpm-config noarch 293-1.el10 appstream 77 k rpm-build x86_64 4.19.1.1-20.el10 appstream 75 k sed x86_64 4.9-3.el10 baseos 322 k shadow-utils x86_64 2:4.15.0-8.el10 baseos 1.3 M tar x86_64 2:1.35-7.el10 baseos 866 k unzip x86_64 6.0-69.el10 baseos 190 k util-linux x86_64 2.40.2-13.el10 baseos 1.3 M which x86_64 2.21-44.el10_0 baseos 42 k xz x86_64 1:5.6.2-4.el10_0 baseos 481 k Installing dependencies: alternatives x86_64 1.30-2.el10 baseos 45 k ansible-srpm-macros noarch 1-16.1.el10_0 epel 20 k audit-libs x86_64 4.0.3-4.el10 baseos 133 k authselect x86_64 1.5.0-8.el10 baseos 148 k authselect-libs x86_64 1.5.0-8.el10 baseos 227 k basesystem noarch 11-22.el10 baseos 8.3 k binutils x86_64 2.41-58.el10_1.2 baseos 6.4 M binutils-gold x86_64 2.41-58.el10_1.2 baseos 797 k bzip2-libs x86_64 1.0.8-25.el10 baseos 43 k ca-certificates noarch 2025.2.80_v9.0.305-102.el10_1 baseos 1.1 M coreutils-common x86_64 9.5-6.el10 baseos 2.2 M cracklib x86_64 2.9.11-8.el10 baseos 100 k cracklib-dicts x86_64 2.9.11-8.el10 baseos 3.7 M crypto-policies noarch 20250905-2.gitc7eb7b2.el10_1 baseos 98 k curl x86_64 8.12.1-2.el10 baseos 219 k cyrus-sasl-lib x86_64 2.1.28-29.el10 baseos 106 k debugedit x86_64 5.1-8.el10 appstream 80 k dwz x86_64 0.16-1.el10 appstream 140 k ed x86_64 1.20-5.el10 baseos 86 k efi-srpm-macros noarch 6-6.el10 appstream 25 k elfutils x86_64 0.193-1.el10 baseos 573 k elfutils-debuginfod-client x86_64 0.193-1.el10 baseos 47 k elfutils-default-yama-scope noarch 0.193-1.el10 baseos 13 k elfutils-libelf x86_64 0.193-1.el10 baseos 208 k elfutils-libs x86_64 0.193-1.el10 baseos 270 k file x86_64 5.45-8.el10 baseos 49 k file-libs x86_64 5.45-8.el10 baseos 764 k filesystem x86_64 3.18-17.el10 baseos 4.8 M fonts-srpm-macros noarch 1:2.0.5-18.el10 appstream 29 k forge-srpm-macros noarch 0.4.0-6.el10 appstream 23 k fpc-srpm-macros noarch 1.3-7.el10_1 epel 7.8 k gdb-minimal x86_64 16.3-2.el10 appstream 4.4 M gdbm x86_64 1:1.23-12.el10_0 baseos 156 k gdbm-libs x86_64 1:1.23-12.el10_0 baseos 60 k ghc-srpm-macros noarch 1.9.2-1.el10_0 epel 9.1 k glibc x86_64 2.39-58.el10_1.2 baseos 2.1 M glibc-common x86_64 2.39-58.el10_1.2 baseos 339 k glibc-gconv-extra x86_64 2.39-58.el10_1.2 baseos 1.7 M gmp x86_64 1:6.2.1-12.el10 baseos 318 k go-srpm-macros noarch 3.6.0-4.el10 appstream 29 k jansson x86_64 2.14-3.el10 baseos 48 k json-c x86_64 0.18-3.el10 baseos 47 k kernel-srpm-macros noarch 1.0-25.el10 appstream 11 k keyutils-libs x86_64 1.6.3-5.el10 baseos 35 k krb5-libs x86_64 1.21.3-8.el10_0 baseos 767 k libacl x86_64 2.3.2-4.el10 baseos 27 k libarchive x86_64 3.7.7-4.el10_0 baseos 414 k libattr x86_64 2.5.2-5.el10 baseos 20 k libblkid x86_64 2.40.2-13.el10 baseos 124 k libbrotli x86_64 1.1.0-6.el10 baseos 349 k libcap x86_64 2.69-7.el10 baseos 95 k libcap-ng x86_64 0.8.4-6.el10 baseos 36 k libcom_err x86_64 1.47.1-4.el10 baseos 27 k libcurl x86_64 8.12.1-2.el10 baseos 371 k libeconf x86_64 0.6.2-4.el10 baseos 36 k libevent x86_64 2.1.12-16.el10 baseos 265 k libfdisk x86_64 2.40.2-13.el10 baseos 159 k libffi x86_64 3.4.4-10.el10 baseos 41 k libgcc x86_64 14.3.1-2.1.el10 baseos 145 k libgomp x86_64 14.3.1-2.1.el10 baseos 368 k libidn2 x86_64 2.3.7-3.el10 baseos 122 k libmount x86_64 2.40.2-13.el10 baseos 155 k libnghttp2 x86_64 1.64.0-2.el10 baseos 80 k libpkgconf x86_64 2.1.0-3.el10 baseos 41 k libpsl x86_64 0.21.5-6.el10 baseos 67 k libpwquality x86_64 1.4.5-12.el10 baseos 127 k libselinux x86_64 3.9-1.el10 baseos 97 k libsemanage x86_64 3.9-1.el10 baseos 122 k libsepol x86_64 3.9-1.el10 baseos 348 k libsmartcols x86_64 2.40.2-13.el10 baseos 83 k libssh x86_64 0.11.1-5.el10_1 baseos 233 k libssh-config noarch 0.11.1-5.el10_1 baseos 8.6 k libstdc++ x86_64 14.3.1-2.1.el10 baseos 924 k libtasn1 x86_64 4.20.0-1.el10 baseos 78 k libunistring x86_64 1.1-10.el10 baseos 550 k libutempter x86_64 1.2.1-15.el10 baseos 30 k libuuid x86_64 2.40.2-13.el10 baseos 28 k libverto x86_64 0.3.2-10.el10 baseos 24 k libxcrypt x86_64 4.4.36-10.el10 baseos 124 k libxml2 x86_64 2.12.5-9.el10_0 baseos 692 k libzstd x86_64 1.5.5-9.el10 baseos 294 k lua-libs x86_64 5.4.6-7.el10 baseos 134 k lua-srpm-macros noarch 1-15.el10 appstream 10 k lz4-libs x86_64 1.9.4-8.el10 baseos 70 k mpfr x86_64 4.2.1-5.el10 baseos 349 k ncurses-base noarch 6.4-14.20240127.el10 baseos 104 k ncurses-libs x86_64 6.4-14.20240127.el10 baseos 342 k ocaml-srpm-macros noarch 10-4.el10 appstream 10 k openblas-srpm-macros noarch 2-19.el10 appstream 9.0 k openldap x86_64 2.6.9-1.el10 baseos 240 k openssl-fips-provider x86_64 3.0.7-8.el10 baseos 9.2 k openssl-fips-provider-so x86_64 3.0.7-8.el10 baseos 576 k openssl-libs x86_64 1:3.5.1-4.el10_1 baseos 2.3 M p11-kit x86_64 0.25.5-7.el10 baseos 501 k p11-kit-trust x86_64 0.25.5-7.el10 baseos 137 k package-notes-srpm-macros noarch 0.5-13.el10 appstream 11 k pam x86_64 1.6.1-8.el10 baseos 586 k pam-libs x86_64 1.6.1-8.el10 baseos 58 k pcre2 x86_64 10.44-1.el10.3 baseos 250 k pcre2-syntax noarch 10.44-1.el10.3 baseos 155 k perl-srpm-macros noarch 1-57.el10 appstream 9.7 k pkgconf x86_64 2.1.0-3.el10 baseos 48 k pkgconf-m4 noarch 2.1.0-3.el10 baseos 15 k pkgconf-pkg-config x86_64 2.1.0-3.el10 baseos 12 k popt x86_64 1.19-8.el10 baseos 70 k publicsuffix-list-dafsa noarch 20240107-5.el10 baseos 60 k pyproject-srpm-macros noarch 1.16.2-1.el10 appstream 16 k python-srpm-macros noarch 3.12-10.el10 appstream 24 k qt6-srpm-macros noarch 6.9.1-1.el10 appstream 11 k readline x86_64 8.2-11.el10 baseos 217 k rpm x86_64 4.19.1.1-20.el10 baseos 560 k rpm-build-libs x86_64 4.19.1.1-20.el10 baseos 93 k rpm-libs x86_64 4.19.1.1-20.el10 baseos 309 k rpm-sequoia x86_64 1.9.0.3-1.el10_1 baseos 968 k rust-toolset-srpm-macros noarch 1.88.0-1.el10 appstream 13 k setup noarch 2.14.5-7.el10 baseos 153 k sqlite-libs x86_64 3.46.1-5.el10_1 baseos 745 k systemd-libs x86_64 257-13.el10 baseos 823 k util-linux-core x86_64 2.40.2-13.el10 baseos 550 k xz-libs x86_64 1:5.6.2-4.el10_0 baseos 113 k zip x86_64 3.0-45.el10 baseos 270 k zlib-ng-compat x86_64 2.2.3-2.el10 baseos 79 k zstd x86_64 1.5.5-9.el10 baseos 468 k Transaction Summary ======================================================================================= Install 146 Packages Total download size: 61 M Installed size: 187 M Downloading Packages: (1/146): alternatives-1.30-2.el10.x86_64.rpm 483 kB/s | 45 kB 00:00 (2/146): authselect-libs-1.5.0-8.el10.x86_64.rp 2.1 MB/s | 227 kB 00:00 (3/146): authselect-1.5.0-8.el10.x86_64.rpm 1.3 MB/s | 148 kB 00:00 (4/146): bzip2-1.0.8-25.el10.x86_64.rpm 790 kB/s | 59 kB 00:00 (5/146): bash-5.2.26-6.el10.x86_64.rpm 20 MB/s | 1.8 MB 00:00 (6/146): basesystem-11-22.el10.noarch.rpm 78 kB/s | 8.3 kB 00:00 (7/146): bzip2-libs-1.0.8-25.el10.x86_64.rpm 597 kB/s | 43 kB 00:00 (8/146): coreutils-common-9.5-6.el10.x86_64.rpm 26 MB/s | 2.2 MB 00:00 (9/146): coreutils-9.5-6.el10.x86_64.rpm 12 MB/s | 1.1 MB 00:00 (10/146): cpio-2.15-3.el10.x86_64.rpm 3.5 MB/s | 296 kB 00:00 (11/146): cracklib-2.9.11-8.el10.x86_64.rpm 1.3 MB/s | 100 kB 00:00 (12/146): cracklib-dicts-2.9.11-8.el10.x86_64.r 38 MB/s | 3.7 MB 00:00 (13/146): diffutils-3.10-8.el10.x86_64.rpm 5.0 MB/s | 413 kB 00:00 (14/146): ed-1.20-5.el10.x86_64.rpm 971 kB/s | 86 kB 00:00 (15/146): findutils-4.10.0-5.el10.x86_64.rpm 7.0 MB/s | 555 kB 00:00 (16/146): gawk-5.3.0-6.el10.x86_64.rpm 13 MB/s | 1.1 MB 00:00 (17/146): gdbm-libs-1.23-12.el10_0.x86_64.rpm 800 kB/s | 60 kB 00:00 (18/146): gdbm-1.23-12.el10_0.x86_64.rpm 1.4 MB/s | 156 kB 00:00 (19/146): grep-3.11-10.el10.x86_64.rpm 3.8 MB/s | 305 kB 00:00 (20/146): gzip-1.13-3.el10.x86_64.rpm 2.2 MB/s | 174 kB 00:00 (21/146): info-7.1-6.el10.x86_64.rpm 2.4 MB/s | 187 kB 00:00 (22/146): jansson-2.14-3.el10.x86_64.rpm 652 kB/s | 48 kB 00:00 (23/146): json-c-0.18-3.el10.x86_64.rpm 628 kB/s | 47 kB 00:00 (24/146): keyutils-libs-1.6.3-5.el10.x86_64.rpm 467 kB/s | 35 kB 00:00 (25/146): libacl-2.3.2-4.el10.x86_64.rpm 366 kB/s | 27 kB 00:00 (26/146): libattr-2.5.2-5.el10.x86_64.rpm 273 kB/s | 20 kB 00:00 (27/146): libbrotli-1.1.0-6.el10.x86_64.rpm 4.7 MB/s | 349 kB 00:00 (28/146): libcap-2.69-7.el10.x86_64.rpm 1.3 MB/s | 95 kB 00:00 (29/146): libcap-ng-0.8.4-6.el10.x86_64.rpm 478 kB/s | 36 kB 00:00 (30/146): libeconf-0.6.2-4.el10.x86_64.rpm 455 kB/s | 36 kB 00:00 (31/146): libevent-2.1.12-16.el10.x86_64.rpm 3.6 MB/s | 265 kB 00:00 (32/146): libidn2-2.3.7-3.el10.x86_64.rpm 1.6 MB/s | 122 kB 00:00 (33/146): libpkgconf-2.1.0-3.el10.x86_64.rpm 553 kB/s | 41 kB 00:00 (34/146): libnghttp2-1.64.0-2.el10.x86_64.rpm 874 kB/s | 80 kB 00:00 (35/146): libpsl-0.21.5-6.el10.x86_64.rpm 843 kB/s | 67 kB 00:00 (36/146): libpwquality-1.4.5-12.el10.x86_64.rpm 1.6 MB/s | 127 kB 00:00 (37/146): libtasn1-4.20.0-1.el10.x86_64.rpm 1.0 MB/s | 78 kB 00:00 (38/146): libunistring-1.1-10.el10.x86_64.rpm 6.5 MB/s | 550 kB 00:00 (39/146): libutempter-1.2.1-15.el10.x86_64.rpm 407 kB/s | 30 kB 00:00 (40/146): libverto-0.3.2-10.el10.x86_64.rpm 319 kB/s | 24 kB 00:00 (41/146): libzstd-1.5.5-9.el10.x86_64.rpm 4.0 MB/s | 294 kB 00:00 (42/146): lua-libs-5.4.6-7.el10.x86_64.rpm 1.8 MB/s | 134 kB 00:00 (43/146): lz4-libs-1.9.4-8.el10.x86_64.rpm 944 kB/s | 70 kB 00:00 (44/146): mpfr-4.2.1-5.el10.x86_64.rpm 4.3 MB/s | 349 kB 00:00 (45/146): libxcrypt-4.4.36-10.el10.x86_64.rpm 602 kB/s | 124 kB 00:00 (46/146): ncurses-base-6.4-14.20240127.el10.noa 1.4 MB/s | 104 kB 00:00 (47/146): ncurses-libs-6.4-14.20240127.el10.x86 4.5 MB/s | 342 kB 00:00 (48/146): p11-kit-0.25.5-7.el10.x86_64.rpm 5.8 MB/s | 501 kB 00:00 (49/146): p11-kit-trust-0.25.5-7.el10.x86_64.rp 1.8 MB/s | 137 kB 00:00 (50/146): pcre2-10.44-1.el10.3.x86_64.rpm 3.2 MB/s | 250 kB 00:00 (51/146): pcre2-syntax-10.44-1.el10.3.noarch.rp 2.0 MB/s | 155 kB 00:00 (52/146): pkgconf-2.1.0-3.el10.x86_64.rpm 675 kB/s | 48 kB 00:00 (53/146): pkgconf-m4-2.1.0-3.el10.noarch.rpm 201 kB/s | 15 kB 00:00 (54/146): pkgconf-pkg-config-2.1.0-3.el10.x86_6 151 kB/s | 12 kB 00:00 (55/146): popt-1.19-8.el10.x86_64.rpm 850 kB/s | 70 kB 00:00 (56/146): publicsuffix-list-dafsa-20240107-5.el 729 kB/s | 60 kB 00:00 (57/146): readline-8.2-11.el10.x86_64.rpm 2.7 MB/s | 217 kB 00:00 (58/146): sed-4.9-3.el10.x86_64.rpm 3.6 MB/s | 322 kB 00:00 (59/146): tar-1.35-7.el10.x86_64.rpm 11 MB/s | 866 kB 00:00 (60/146): zstd-1.5.5-9.el10.x86_64.rpm 5.6 MB/s | 468 kB 00:00 (61/146): krb5-libs-1.21.3-8.el10_0.x86_64.rpm 9.3 MB/s | 767 kB 00:00 (62/146): libarchive-3.7.7-4.el10_0.x86_64.rpm 5.3 MB/s | 414 kB 00:00 (63/146): which-2.21-44.el10_0.x86_64.rpm 568 kB/s | 42 kB 00:00 (64/146): libxml2-2.12.5-9.el10_0.x86_64.rpm 6.6 MB/s | 692 kB 00:00 (65/146): xz-5.6.2-4.el10_0.x86_64.rpm 6.2 MB/s | 481 kB 00:00 (66/146): audit-libs-4.0.3-4.el10.x86_64.rpm 1.8 MB/s | 133 kB 00:00 (67/146): crypto-policies-20250905-2.gitc7eb7b2 1.3 MB/s | 98 kB 00:00 (68/146): xz-libs-5.6.2-4.el10_0.x86_64.rpm 794 kB/s | 113 kB 00:00 (69/146): curl-8.12.1-2.el10.x86_64.rpm 2.9 MB/s | 219 kB 00:00 (70/146): cyrus-sasl-lib-2.1.28-29.el10.x86_64. 1.4 MB/s | 106 kB 00:00 (71/146): elfutils-0.193-1.el10.x86_64.rpm 7.2 MB/s | 573 kB 00:00 (72/146): elfutils-debuginfod-client-0.193-1.el 648 kB/s | 47 kB 00:00 (73/146): elfutils-default-yama-scope-0.193-1.e 174 kB/s | 13 kB 00:00 (74/146): elfutils-libs-0.193-1.el10.x86_64.rpm 3.6 MB/s | 270 kB 00:00 (75/146): elfutils-libelf-0.193-1.el10.x86_64.r 2.5 MB/s | 208 kB 00:00 (76/146): file-5.45-8.el10.x86_64.rpm 673 kB/s | 49 kB 00:00 (77/146): file-libs-5.45-8.el10.x86_64.rpm 9.6 MB/s | 764 kB 00:00 (78/146): gmp-6.2.1-12.el10.x86_64.rpm 4.1 MB/s | 318 kB 00:00 (79/146): filesystem-3.18-17.el10.x86_64.rpm 52 MB/s | 4.8 MB 00:00 (80/146): libcom_err-1.47.1-4.el10.x86_64.rpm 365 kB/s | 27 kB 00:00 (81/146): libblkid-2.40.2-13.el10.x86_64.rpm 1.6 MB/s | 124 kB 00:00 (82/146): libcurl-8.12.1-2.el10.x86_64.rpm 5.0 MB/s | 371 kB 00:00 (83/146): libfdisk-2.40.2-13.el10.x86_64.rpm 2.1 MB/s | 159 kB 00:00 (84/146): libffi-3.4.4-10.el10.x86_64.rpm 529 kB/s | 41 kB 00:00 (85/146): libgcc-14.3.1-2.1.el10.x86_64.rpm 1.9 MB/s | 145 kB 00:00 (86/146): libmount-2.40.2-13.el10.x86_64.rpm 2.0 MB/s | 155 kB 00:00 (87/146): libselinux-3.9-1.el10.x86_64.rpm 1.3 MB/s | 97 kB 00:00 (88/146): libgomp-14.3.1-2.1.el10.x86_64.rpm 3.6 MB/s | 368 kB 00:00 (89/146): libsepol-3.9-1.el10.x86_64.rpm 4.7 MB/s | 348 kB 00:00 (90/146): libsemanage-3.9-1.el10.x86_64.rpm 1.5 MB/s | 122 kB 00:00 (91/146): libsmartcols-2.40.2-13.el10.x86_64.rp 965 kB/s | 83 kB 00:00 (92/146): libstdc++-14.3.1-2.1.el10.x86_64.rpm 12 MB/s | 924 kB 00:00 (93/146): libuuid-2.40.2-13.el10.x86_64.rpm 383 kB/s | 28 kB 00:00 (94/146): openldap-2.6.9-1.el10.x86_64.rpm 3.0 MB/s | 240 kB 00:00 (95/146): openssl-fips-provider-3.0.7-8.el10.x8 125 kB/s | 9.2 kB 00:00 (96/146): pam-1.6.1-8.el10.x86_64.rpm 7.0 MB/s | 586 kB 00:00 (97/146): openssl-fips-provider-so-3.0.7-8.el10 4.8 MB/s | 576 kB 00:00 (98/146): pam-libs-1.6.1-8.el10.x86_64.rpm 770 kB/s | 58 kB 00:00 (99/146): rpm-4.19.1.1-20.el10.x86_64.rpm 7.4 MB/s | 560 kB 00:00 (100/146): rpm-build-libs-4.19.1.1-20.el10.x86_ 1.2 MB/s | 93 kB 00:00 (101/146): rpm-libs-4.19.1.1-20.el10.x86_64.rpm 4.0 MB/s | 309 kB 00:00 (102/146): rpm-sequoia-1.9.0.3-1.el10_1.x86_64. 10 MB/s | 968 kB 00:00 (103/146): shadow-utils-4.15.0-8.el10.x86_64.rp 18 MB/s | 1.3 MB 00:00 (104/146): systemd-libs-257-13.el10.x86_64.rpm 11 MB/s | 823 kB 00:00 (105/146): setup-2.14.5-7.el10.noarch.rpm 790 kB/s | 153 kB 00:00 (106/146): sqlite-libs-3.46.1-5.el10_1.x86_64.r 6.3 MB/s | 745 kB 00:00 (107/146): unzip-6.0-69.el10.x86_64.rpm 2.3 MB/s | 190 kB 00:00 (108/146): util-linux-2.40.2-13.el10.x86_64.rpm 16 MB/s | 1.3 MB 00:00 (109/146): util-linux-core-2.40.2-13.el10.x86_6 6.9 MB/s | 550 kB 00:00 (110/146): zip-3.0-45.el10.x86_64.rpm 3.6 MB/s | 270 kB 00:00 (111/146): zlib-ng-compat-2.2.3-2.el10.x86_64.r 1.0 MB/s | 79 kB 00:00 (112/146): glibc-2.39-58.el10_1.2.x86_64.rpm 25 MB/s | 2.1 MB 00:00 (113/146): glibc-common-2.39-58.el10_1.2.x86_64 4.3 MB/s | 339 kB 00:00 (114/146): glibc-gconv-extra-2.39-58.el10_1.2.x 20 MB/s | 1.7 MB 00:00 (115/146): glibc-minimal-langpack-2.39-58.el10_ 568 kB/s | 45 kB 00:00 (116/146): ca-certificates-2025.2.80_v9.0.305-1 14 MB/s | 1.1 MB 00:00 (117/146): redhat-release-10.1-18.el10.x86_64.r 821 kB/s | 61 kB 00:00 (118/146): openssl-libs-3.5.1-4.el10_1.x86_64.r 27 MB/s | 2.3 MB 00:00 (119/146): libssh-0.11.1-5.el10_1.x86_64.rpm 2.8 MB/s | 233 kB 00:00 (120/146): libssh-config-0.11.1-5.el10_1.noarch 113 kB/s | 8.6 kB 00:00 (121/146): binutils-2.41-58.el10_1.2.x86_64.rpm 65 MB/s | 6.4 MB 00:00 (122/146): binutils-gold-2.41-58.el10_1.2.x86_6 10 MB/s | 797 kB 00:00 (123/146): fonts-srpm-macros-2.0.5-18.el10.noar 385 kB/s | 29 kB 00:00 (124/146): perl-srpm-macros-1-57.el10.noarch.rp 118 kB/s | 9.7 kB 00:00 (125/146): efi-srpm-macros-6-6.el10.noarch.rpm 328 kB/s | 25 kB 00:00 (126/146): lua-srpm-macros-1-15.el10.noarch.rpm 138 kB/s | 10 kB 00:00 (127/146): package-notes-srpm-macros-0.5-13.el1 142 kB/s | 11 kB 00:00 (128/146): ocaml-srpm-macros-10-4.el10.noarch.r 126 kB/s | 10 kB 00:00 (129/146): openblas-srpm-macros-2-19.el10.noarc 118 kB/s | 9.0 kB 00:00 (130/146): go-srpm-macros-3.6.0-4.el10.noarch.r 380 kB/s | 29 kB 00:00 (131/146): kernel-srpm-macros-1.0-25.el10.noarc 148 kB/s | 11 kB 00:00 (132/146): patch-2.7.6-26.el10.x86_64.rpm 1.7 MB/s | 134 kB 00:00 (133/146): pyproject-srpm-macros-1.16.2-1.el10. 208 kB/s | 16 kB 00:00 (134/146): forge-srpm-macros-0.4.0-6.el10.noarc 288 kB/s | 23 kB 00:00 (135/146): python-srpm-macros-3.12-10.el10.noar 300 kB/s | 24 kB 00:00 (136/146): qt6-srpm-macros-6.9.1-1.el10.noarch. 147 kB/s | 11 kB 00:00 (137/146): redhat-rpm-config-293-1.el10.noarch. 962 kB/s | 77 kB 00:00 (138/146): rpm-build-4.19.1.1-20.el10.x86_64.rp 956 kB/s | 75 kB 00:00 (139/146): rust-toolset-srpm-macros-1.88.0-1.el 170 kB/s | 13 kB 00:00 (140/146): dwz-0.16-1.el10.x86_64.rpm 1.8 MB/s | 140 kB 00:00 (141/146): debugedit-5.1-8.el10.x86_64.rpm 767 kB/s | 80 kB 00:00 (142/146): gdb-minimal-16.3-2.el10.x86_64.rpm 51 MB/s | 4.4 MB 00:00 (143/146): epel-rpm-macros-10-6.el10_1.noarch.r 339 kB/s | 8.3 kB 00:00 (144/146): ghc-srpm-macros-1.9.2-1.el10_0.noarc 1.9 MB/s | 9.1 kB 00:00 (145/146): ansible-srpm-macros-1-16.1.el10_0.no 286 kB/s | 20 kB 00:00 (146/146): fpc-srpm-macros-1.3-7.el10_1.noarch. 144 kB/s | 7.8 kB 00:00 -------------------------------------------------------------------------------- Total 14 MB/s | 61 MB 00:04 Red Hat Enterprise Linux 10 for x86_64 - BaseOS 3.6 MB/s | 3.7 kB 00:00 Importing GPG key 0x5A6340B3: Userid : "Red Hat, Inc. (auxiliary key 3) " Fingerprint: 7E46 2425 8C40 6535 D56D 6F13 5054 E4A4 5A63 40B3 From : /usr/share/distribution-gpg-keys/redhat/RPM-GPG-KEY-redhat10-release Key imported successfully Importing GPG key 0xFD431D51: Userid : "Red Hat, Inc. (release key 2) " Fingerprint: 567E 347A D004 4ADE 55BA 8A5F 199E 2F91 FD43 1D51 From : /usr/share/distribution-gpg-keys/redhat/RPM-GPG-KEY-redhat10-release Key imported successfully Extra Packages for Enterprise Linux 10 - x86_64 1.6 MB/s | 1.6 kB 00:00 Importing GPG key 0xE37ED158: Userid : "Fedora (epel10) " Fingerprint: 7D8D 15CB FC4E 6268 8591 FB26 33D9 8517 E37E D158 From : /usr/share/distribution-gpg-keys/epel/RPM-GPG-KEY-EPEL-10 Key imported successfully Running transaction check Transaction check succeeded. Running transaction test Transaction test succeeded. Running transaction Running scriptlet: filesystem-3.18-17.el10.x86_64 1/1 Preparing : 1/1 Installing : libgcc-14.3.1-2.1.el10.x86_64 1/146 Running scriptlet: libgcc-14.3.1-2.1.el10.x86_64 1/146 Installing : redhat-release-10.1-18.el10.x86_64 2/146 Running scriptlet: setup-2.14.5-7.el10.noarch 3/146 Creating group 'adm' with GID 4. Creating group 'audio' with GID 63. Creating group 'bin' with GID 1. Creating group 'cdrom' with GID 11. Creating group 'clock' with GID 103. Creating group 'daemon' with GID 2. Creating group 'dialout' with GID 18. Creating group 'disk' with GID 6. Creating group 'floppy' with GID 19. Creating group 'ftp' with GID 50. Creating group 'games' with GID 20. Creating group 'kmem' with GID 9. Creating group 'lock' with GID 54. Creating group 'lp' with GID 7. Creating group 'mail' with GID 12. Creating group 'man' with GID 15. Creating group 'mem' with GID 8. Creating group 'nobody' with GID 65534. Creating group 'root' with GID 0. Creating group 'sys' with GID 3. Creating group 'tape' with GID 33. Creating group 'tty' with GID 5. Creating group 'users' with GID 100. Creating group 'video' with GID 39. Creating group 'wheel' with GID 10. Creating user 'adm' (adm) with UID 3 and GID 4. Creating user 'bin' (bin) with UID 1 and GID 1. Creating user 'daemon' (daemon) with UID 2 and GID 2. Creating user 'ftp' (FTP User) with UID 14 and GID 50. Creating user 'games' (games) with UID 12 and GID 20. Creating user 'halt' (halt) with UID 7 and GID 0. Creating user 'lp' (lp) with UID 4 and GID 7. Creating user 'mail' (mail) with UID 8 and GID 12. Creating user 'nobody' (Kernel Overflow User) with UID 65534 and GID 65534. Creating user 'operator' (operator) with UID 11 and GID 0. Creating user 'root' (Super User) with UID 0 and GID 0. Creating user 'shutdown' (shutdown) with UID 6 and GID 0. Creating user 'sync' (sync) with UID 5 and GID 0. Installing : setup-2.14.5-7.el10.noarch 3/146 warning: /etc/hosts created as /etc/hosts.rpmnew Running scriptlet: setup-2.14.5-7.el10.noarch 3/146 Installing : filesystem-3.18-17.el10.x86_64 4/146 Installing : basesystem-11-22.el10.noarch 5/146 Installing : ghc-srpm-macros-1.9.2-1.el10_0.noarch 6/146 Installing : fpc-srpm-macros-1.3-7.el10_1.noarch 7/146 Installing : ansible-srpm-macros-1-16.1.el10_0.noarch 8/146 Installing : rust-toolset-srpm-macros-1.88.0-1.el10.noarch 9/146 Installing : qt6-srpm-macros-6.9.1-1.el10.noarch 10/146 Installing : kernel-srpm-macros-1.0-25.el10.noarch 11/146 Installing : openblas-srpm-macros-2-19.el10.noarch 12/146 Installing : ocaml-srpm-macros-10-4.el10.noarch 13/146 Installing : package-notes-srpm-macros-0.5-13.el10.noarch 14/146 Installing : perl-srpm-macros-1-57.el10.noarch 15/146 Installing : libssh-config-0.11.1-5.el10_1.noarch 16/146 Installing : publicsuffix-list-dafsa-20240107-5.el10.noarch 17/146 Installing : pkgconf-m4-2.1.0-3.el10.noarch 18/146 Installing : pcre2-syntax-10.44-1.el10.3.noarch 19/146 Installing : ncurses-base-6.4-14.20240127.el10.noarch 20/146 Installing : bash-5.2.26-6.el10.x86_64 21/146 Running scriptlet: bash-5.2.26-6.el10.x86_64 21/146 Installing : ncurses-libs-6.4-14.20240127.el10.x86_64 22/146 Installing : glibc-common-2.39-58.el10_1.2.x86_64 23/146 Installing : glibc-gconv-extra-2.39-58.el10_1.2.x86_64 24/146 Running scriptlet: glibc-gconv-extra-2.39-58.el10_1.2.x86_64 24/146 Installing : glibc-minimal-langpack-2.39-58.el10_1.2.x86_64 25/146 Running scriptlet: glibc-2.39-58.el10_1.2.x86_64 26/146 Installing : glibc-2.39-58.el10_1.2.x86_64 26/146 Running scriptlet: glibc-2.39-58.el10_1.2.x86_64 26/146 Installing : zlib-ng-compat-2.2.3-2.el10.x86_64 27/146 Installing : bzip2-libs-1.0.8-25.el10.x86_64 28/146 Installing : xz-libs-1:5.6.2-4.el10_0.x86_64 29/146 Installing : popt-1.19-8.el10.x86_64 30/146 Installing : readline-8.2-11.el10.x86_64 31/146 Installing : libstdc++-14.3.1-2.1.el10.x86_64 32/146 Installing : libuuid-2.40.2-13.el10.x86_64 33/146 Installing : libblkid-2.40.2-13.el10.x86_64 34/146 Installing : libattr-2.5.2-5.el10.x86_64 35/146 Installing : libacl-2.3.2-4.el10.x86_64 36/146 Installing : libxcrypt-4.4.36-10.el10.x86_64 37/146 Installing : libzstd-1.5.5-9.el10.x86_64 38/146 Installing : elfutils-libelf-0.193-1.el10.x86_64 39/146 Installing : gmp-1:6.2.1-12.el10.x86_64 40/146 Installing : gdbm-libs-1:1.23-12.el10_0.x86_64 41/146 Installing : libeconf-0.6.2-4.el10.x86_64 42/146 Installing : mpfr-4.2.1-5.el10.x86_64 43/146 Installing : gawk-5.3.0-6.el10.x86_64 44/146 Installing : dwz-0.16-1.el10.x86_64 45/146 Installing : unzip-6.0-69.el10.x86_64 46/146 Installing : file-libs-5.45-8.el10.x86_64 47/146 Installing : file-5.45-8.el10.x86_64 48/146 Installing : alternatives-1.30-2.el10.x86_64 49/146 Installing : jansson-2.14-3.el10.x86_64 50/146 Installing : libcap-ng-0.8.4-6.el10.x86_64 51/146 Installing : audit-libs-4.0.3-4.el10.x86_64 52/146 Installing : pam-libs-1.6.1-8.el10.x86_64 53/146 Installing : libcap-2.69-7.el10.x86_64 54/146 Installing : systemd-libs-257-13.el10.x86_64 55/146 Installing : libtasn1-4.20.0-1.el10.x86_64 56/146 Installing : libunistring-1.1-10.el10.x86_64 57/146 Installing : libidn2-2.3.7-3.el10.x86_64 58/146 Installing : lua-libs-5.4.6-7.el10.x86_64 59/146 Installing : lz4-libs-1.9.4-8.el10.x86_64 60/146 Installing : pcre2-10.44-1.el10.3.x86_64 61/146 Installing : grep-3.11-10.el10.x86_64 62/146 Installing : xz-1:5.6.2-4.el10_0.x86_64 63/146 Installing : libffi-3.4.4-10.el10.x86_64 64/146 Installing : libsepol-3.9-1.el10.x86_64 65/146 Installing : libselinux-3.9-1.el10.x86_64 66/146 Installing : sed-4.9-3.el10.x86_64 67/146 Installing : findutils-1:4.10.0-5.el10.x86_64 68/146 Installing : libmount-2.40.2-13.el10.x86_64 69/146 Installing : libsmartcols-2.40.2-13.el10.x86_64 70/146 Running scriptlet: crypto-policies-20250905-2.gitc7eb7b2.el10_1.noa 71/146 Installing : crypto-policies-20250905-2.gitc7eb7b2.el10_1.noa 71/146 Running scriptlet: crypto-policies-20250905-2.gitc7eb7b2.el10_1.noa 71/146 Installing : util-linux-core-2.40.2-13.el10.x86_64 72/146 Installing : tar-2:1.35-7.el10.x86_64 73/146 Installing : libsemanage-3.9-1.el10.x86_64 74/146 Installing : shadow-utils-2:4.15.0-8.el10.x86_64 75/146 Running scriptlet: libutempter-1.2.1-15.el10.x86_64 76/146 Installing : libutempter-1.2.1-15.el10.x86_64 76/146 Installing : p11-kit-0.25.5-7.el10.x86_64 77/146 Installing : p11-kit-trust-0.25.5-7.el10.x86_64 78/146 Running scriptlet: p11-kit-trust-0.25.5-7.el10.x86_64 78/146 Installing : zstd-1.5.5-9.el10.x86_64 79/146 Installing : libpsl-0.21.5-6.el10.x86_64 80/146 Installing : zip-3.0-45.el10.x86_64 81/146 Installing : gdbm-1:1.23-12.el10_0.x86_64 82/146 Installing : cyrus-sasl-lib-2.1.28-29.el10.x86_64 83/146 Installing : libfdisk-2.40.2-13.el10.x86_64 84/146 Installing : libxml2-2.12.5-9.el10_0.x86_64 85/146 Installing : bzip2-1.0.8-25.el10.x86_64 86/146 Installing : sqlite-libs-3.46.1-5.el10_1.x86_64 87/146 Installing : cpio-2.15-3.el10.x86_64 88/146 Installing : diffutils-3.10-8.el10.x86_64 89/146 Installing : ed-1.20-5.el10.x86_64 90/146 Installing : patch-2.7.6-26.el10.x86_64 91/146 Installing : json-c-0.18-3.el10.x86_64 92/146 Installing : keyutils-libs-1.6.3-5.el10.x86_64 93/146 Installing : libbrotli-1.1.0-6.el10.x86_64 94/146 Installing : libnghttp2-1.64.0-2.el10.x86_64 95/146 Installing : libpkgconf-2.1.0-3.el10.x86_64 96/146 Installing : pkgconf-2.1.0-3.el10.x86_64 97/146 Installing : pkgconf-pkg-config-2.1.0-3.el10.x86_64 98/146 Installing : libverto-0.3.2-10.el10.x86_64 99/146 Installing : libcom_err-1.47.1-4.el10.x86_64 100/146 Installing : libgomp-14.3.1-2.1.el10.x86_64 101/146 Installing : elfutils-default-yama-scope-0.193-1.el10.noarch 102/146 Running scriptlet: elfutils-default-yama-scope-0.193-1.el10.noarch 102/146 Installing : elfutils-libs-0.193-1.el10.x86_64 103/146 Installing : coreutils-common-9.5-6.el10.x86_64 104/146 Installing : openssl-fips-provider-so-3.0.7-8.el10.x86_64 105/146 Installing : openssl-fips-provider-3.0.7-8.el10.x86_64 106/146 Installing : openssl-libs-1:3.5.1-4.el10_1.x86_64 107/146 Installing : coreutils-9.5-6.el10.x86_64 108/146 Running scriptlet: ca-certificates-2025.2.80_v9.0.305-102.el10_1.no 109/146 Installing : ca-certificates-2025.2.80_v9.0.305-102.el10_1.no 109/146 Running scriptlet: ca-certificates-2025.2.80_v9.0.305-102.el10_1.no 109/146 Installing : authselect-libs-1.5.0-8.el10.x86_64 110/146 Installing : gzip-1.13-3.el10.x86_64 111/146 Installing : cracklib-2.9.11-8.el10.x86_64 112/146 Installing : krb5-libs-1.21.3-8.el10_0.x86_64 113/146 Installing : libarchive-3.7.7-4.el10_0.x86_64 114/146 Installing : libssh-0.11.1-5.el10_1.x86_64 115/146 Installing : cracklib-dicts-2.9.11-8.el10.x86_64 116/146 Installing : libpwquality-1.4.5-12.el10.x86_64 117/146 Installing : pam-1.6.1-8.el10.x86_64 118/146 Installing : libevent-2.1.12-16.el10.x86_64 119/146 Installing : openldap-2.6.9-1.el10.x86_64 120/146 Installing : libcurl-8.12.1-2.el10.x86_64 121/146 Installing : elfutils-debuginfod-client-0.193-1.el10.x86_64 122/146 Installing : binutils-gold-2.41-58.el10_1.2.x86_64 123/146 Running scriptlet: binutils-gold-2.41-58.el10_1.2.x86_64 123/146 Installing : binutils-2.41-58.el10_1.2.x86_64 124/146 Running scriptlet: binutils-2.41-58.el10_1.2.x86_64 124/146 Installing : elfutils-0.193-1.el10.x86_64 125/146 Installing : gdb-minimal-16.3-2.el10.x86_64 126/146 Installing : debugedit-5.1-8.el10.x86_64 127/146 Installing : curl-8.12.1-2.el10.x86_64 128/146 Installing : rpm-sequoia-1.9.0.3-1.el10_1.x86_64 129/146 Installing : rpm-libs-4.19.1.1-20.el10.x86_64 130/146 Running scriptlet: rpm-4.19.1.1-20.el10.x86_64 131/146 Installing : rpm-4.19.1.1-20.el10.x86_64 131/146 Installing : efi-srpm-macros-6-6.el10.noarch 132/146 Installing : lua-srpm-macros-1-15.el10.noarch 133/146 Installing : rpm-build-libs-4.19.1.1-20.el10.x86_64 134/146 Installing : go-srpm-macros-3.6.0-4.el10.noarch 135/146 Installing : fonts-srpm-macros-1:2.0.5-18.el10.noarch 136/146 Installing : forge-srpm-macros-0.4.0-6.el10.noarch 137/146 Installing : python-srpm-macros-3.12-10.el10.noarch 138/146 Installing : redhat-rpm-config-293-1.el10.noarch 139/146 Installing : rpm-build-4.19.1.1-20.el10.x86_64 140/146 Installing : pyproject-srpm-macros-1.16.2-1.el10.noarch 141/146 Installing : util-linux-2.40.2-13.el10.x86_64 142/146 Running scriptlet: util-linux-2.40.2-13.el10.x86_64 142/146 Installing : authselect-1.5.0-8.el10.x86_64 143/146 Installing : which-2.21-44.el10_0.x86_64 144/146 Installing : info-7.1-6.el10.x86_64 145/146 Installing : epel-rpm-macros-10-6.el10_1.noarch 146/146 Running scriptlet: filesystem-3.18-17.el10.x86_64 146/146 Running scriptlet: ca-certificates-2025.2.80_v9.0.305-102.el10_1.no 146/146 Running scriptlet: authselect-libs-1.5.0-8.el10.x86_64 146/146 Running scriptlet: rpm-4.19.1.1-20.el10.x86_64 146/146 Running scriptlet: epel-rpm-macros-10-6.el10_1.noarch 146/146 Installed products updated. Installed: alternatives-1.30-2.el10.x86_64 ansible-srpm-macros-1-16.1.el10_0.noarch audit-libs-4.0.3-4.el10.x86_64 authselect-1.5.0-8.el10.x86_64 authselect-libs-1.5.0-8.el10.x86_64 basesystem-11-22.el10.noarch bash-5.2.26-6.el10.x86_64 binutils-2.41-58.el10_1.2.x86_64 binutils-gold-2.41-58.el10_1.2.x86_64 bzip2-1.0.8-25.el10.x86_64 bzip2-libs-1.0.8-25.el10.x86_64 ca-certificates-2025.2.80_v9.0.305-102.el10_1.noarch coreutils-9.5-6.el10.x86_64 coreutils-common-9.5-6.el10.x86_64 cpio-2.15-3.el10.x86_64 cracklib-2.9.11-8.el10.x86_64 cracklib-dicts-2.9.11-8.el10.x86_64 crypto-policies-20250905-2.gitc7eb7b2.el10_1.noarch curl-8.12.1-2.el10.x86_64 cyrus-sasl-lib-2.1.28-29.el10.x86_64 debugedit-5.1-8.el10.x86_64 diffutils-3.10-8.el10.x86_64 dwz-0.16-1.el10.x86_64 ed-1.20-5.el10.x86_64 efi-srpm-macros-6-6.el10.noarch elfutils-0.193-1.el10.x86_64 elfutils-debuginfod-client-0.193-1.el10.x86_64 elfutils-default-yama-scope-0.193-1.el10.noarch elfutils-libelf-0.193-1.el10.x86_64 elfutils-libs-0.193-1.el10.x86_64 epel-rpm-macros-10-6.el10_1.noarch file-5.45-8.el10.x86_64 file-libs-5.45-8.el10.x86_64 filesystem-3.18-17.el10.x86_64 findutils-1:4.10.0-5.el10.x86_64 fonts-srpm-macros-1:2.0.5-18.el10.noarch forge-srpm-macros-0.4.0-6.el10.noarch fpc-srpm-macros-1.3-7.el10_1.noarch gawk-5.3.0-6.el10.x86_64 gdb-minimal-16.3-2.el10.x86_64 gdbm-1:1.23-12.el10_0.x86_64 gdbm-libs-1:1.23-12.el10_0.x86_64 ghc-srpm-macros-1.9.2-1.el10_0.noarch glibc-2.39-58.el10_1.2.x86_64 glibc-common-2.39-58.el10_1.2.x86_64 glibc-gconv-extra-2.39-58.el10_1.2.x86_64 glibc-minimal-langpack-2.39-58.el10_1.2.x86_64 gmp-1:6.2.1-12.el10.x86_64 go-srpm-macros-3.6.0-4.el10.noarch grep-3.11-10.el10.x86_64 gzip-1.13-3.el10.x86_64 info-7.1-6.el10.x86_64 jansson-2.14-3.el10.x86_64 json-c-0.18-3.el10.x86_64 kernel-srpm-macros-1.0-25.el10.noarch keyutils-libs-1.6.3-5.el10.x86_64 krb5-libs-1.21.3-8.el10_0.x86_64 libacl-2.3.2-4.el10.x86_64 libarchive-3.7.7-4.el10_0.x86_64 libattr-2.5.2-5.el10.x86_64 libblkid-2.40.2-13.el10.x86_64 libbrotli-1.1.0-6.el10.x86_64 libcap-2.69-7.el10.x86_64 libcap-ng-0.8.4-6.el10.x86_64 libcom_err-1.47.1-4.el10.x86_64 libcurl-8.12.1-2.el10.x86_64 libeconf-0.6.2-4.el10.x86_64 libevent-2.1.12-16.el10.x86_64 libfdisk-2.40.2-13.el10.x86_64 libffi-3.4.4-10.el10.x86_64 libgcc-14.3.1-2.1.el10.x86_64 libgomp-14.3.1-2.1.el10.x86_64 libidn2-2.3.7-3.el10.x86_64 libmount-2.40.2-13.el10.x86_64 libnghttp2-1.64.0-2.el10.x86_64 libpkgconf-2.1.0-3.el10.x86_64 libpsl-0.21.5-6.el10.x86_64 libpwquality-1.4.5-12.el10.x86_64 libselinux-3.9-1.el10.x86_64 libsemanage-3.9-1.el10.x86_64 libsepol-3.9-1.el10.x86_64 libsmartcols-2.40.2-13.el10.x86_64 libssh-0.11.1-5.el10_1.x86_64 libssh-config-0.11.1-5.el10_1.noarch libstdc++-14.3.1-2.1.el10.x86_64 libtasn1-4.20.0-1.el10.x86_64 libunistring-1.1-10.el10.x86_64 libutempter-1.2.1-15.el10.x86_64 libuuid-2.40.2-13.el10.x86_64 libverto-0.3.2-10.el10.x86_64 libxcrypt-4.4.36-10.el10.x86_64 libxml2-2.12.5-9.el10_0.x86_64 libzstd-1.5.5-9.el10.x86_64 lua-libs-5.4.6-7.el10.x86_64 lua-srpm-macros-1-15.el10.noarch lz4-libs-1.9.4-8.el10.x86_64 mpfr-4.2.1-5.el10.x86_64 ncurses-base-6.4-14.20240127.el10.noarch ncurses-libs-6.4-14.20240127.el10.x86_64 ocaml-srpm-macros-10-4.el10.noarch openblas-srpm-macros-2-19.el10.noarch openldap-2.6.9-1.el10.x86_64 openssl-fips-provider-3.0.7-8.el10.x86_64 openssl-fips-provider-so-3.0.7-8.el10.x86_64 openssl-libs-1:3.5.1-4.el10_1.x86_64 p11-kit-0.25.5-7.el10.x86_64 p11-kit-trust-0.25.5-7.el10.x86_64 package-notes-srpm-macros-0.5-13.el10.noarch pam-1.6.1-8.el10.x86_64 pam-libs-1.6.1-8.el10.x86_64 patch-2.7.6-26.el10.x86_64 pcre2-10.44-1.el10.3.x86_64 pcre2-syntax-10.44-1.el10.3.noarch perl-srpm-macros-1-57.el10.noarch pkgconf-2.1.0-3.el10.x86_64 pkgconf-m4-2.1.0-3.el10.noarch pkgconf-pkg-config-2.1.0-3.el10.x86_64 popt-1.19-8.el10.x86_64 publicsuffix-list-dafsa-20240107-5.el10.noarch pyproject-srpm-macros-1.16.2-1.el10.noarch python-srpm-macros-3.12-10.el10.noarch qt6-srpm-macros-6.9.1-1.el10.noarch readline-8.2-11.el10.x86_64 redhat-release-10.1-18.el10.x86_64 redhat-rpm-config-293-1.el10.noarch rpm-4.19.1.1-20.el10.x86_64 rpm-build-4.19.1.1-20.el10.x86_64 rpm-build-libs-4.19.1.1-20.el10.x86_64 rpm-libs-4.19.1.1-20.el10.x86_64 rpm-sequoia-1.9.0.3-1.el10_1.x86_64 rust-toolset-srpm-macros-1.88.0-1.el10.noarch sed-4.9-3.el10.x86_64 setup-2.14.5-7.el10.noarch shadow-utils-2:4.15.0-8.el10.x86_64 sqlite-libs-3.46.1-5.el10_1.x86_64 systemd-libs-257-13.el10.x86_64 tar-2:1.35-7.el10.x86_64 unzip-6.0-69.el10.x86_64 util-linux-2.40.2-13.el10.x86_64 util-linux-core-2.40.2-13.el10.x86_64 which-2.21-44.el10_0.x86_64 xz-1:5.6.2-4.el10_0.x86_64 xz-libs-1:5.6.2-4.el10_0.x86_64 zip-3.0-45.el10.x86_64 zlib-ng-compat-2.2.3-2.el10.x86_64 zstd-1.5.5-9.el10.x86_64 Complete! Finish: installing minimal buildroot with dnf Start: creating root cache Finish: creating root cache Finish: chroot init INFO: Installed packages: INFO: alternatives-1.30-2.el10.x86_64 ansible-srpm-macros-1-16.1.el10_0.noarch audit-libs-4.0.3-4.el10.x86_64 authselect-1.5.0-8.el10.x86_64 authselect-libs-1.5.0-8.el10.x86_64 basesystem-11-22.el10.noarch bash-5.2.26-6.el10.x86_64 binutils-2.41-58.el10_1.2.x86_64 binutils-gold-2.41-58.el10_1.2.x86_64 bzip2-1.0.8-25.el10.x86_64 bzip2-libs-1.0.8-25.el10.x86_64 ca-certificates-2025.2.80_v9.0.305-102.el10_1.noarch coreutils-9.5-6.el10.x86_64 coreutils-common-9.5-6.el10.x86_64 cpio-2.15-3.el10.x86_64 cracklib-2.9.11-8.el10.x86_64 cracklib-dicts-2.9.11-8.el10.x86_64 crypto-policies-20250905-2.gitc7eb7b2.el10_1.noarch curl-8.12.1-2.el10.x86_64 cyrus-sasl-lib-2.1.28-29.el10.x86_64 debugedit-5.1-8.el10.x86_64 diffutils-3.10-8.el10.x86_64 dwz-0.16-1.el10.x86_64 ed-1.20-5.el10.x86_64 efi-srpm-macros-6-6.el10.noarch elfutils-0.193-1.el10.x86_64 elfutils-debuginfod-client-0.193-1.el10.x86_64 elfutils-default-yama-scope-0.193-1.el10.noarch elfutils-libelf-0.193-1.el10.x86_64 elfutils-libs-0.193-1.el10.x86_64 epel-rpm-macros-10-6.el10_1.noarch file-5.45-8.el10.x86_64 file-libs-5.45-8.el10.x86_64 filesystem-3.18-17.el10.x86_64 findutils-4.10.0-5.el10.x86_64 fonts-srpm-macros-2.0.5-18.el10.noarch forge-srpm-macros-0.4.0-6.el10.noarch fpc-srpm-macros-1.3-7.el10_1.noarch gawk-5.3.0-6.el10.x86_64 gdb-minimal-16.3-2.el10.x86_64 gdbm-1.23-12.el10_0.x86_64 gdbm-libs-1.23-12.el10_0.x86_64 ghc-srpm-macros-1.9.2-1.el10_0.noarch glibc-2.39-58.el10_1.2.x86_64 glibc-common-2.39-58.el10_1.2.x86_64 glibc-gconv-extra-2.39-58.el10_1.2.x86_64 glibc-minimal-langpack-2.39-58.el10_1.2.x86_64 gmp-6.2.1-12.el10.x86_64 go-srpm-macros-3.6.0-4.el10.noarch gpg-pubkey-5a6340b3-6229229e gpg-pubkey-e37ed158-65785fa9 gpg-pubkey-fd431d51-4ae0493b grep-3.11-10.el10.x86_64 gzip-1.13-3.el10.x86_64 info-7.1-6.el10.x86_64 jansson-2.14-3.el10.x86_64 json-c-0.18-3.el10.x86_64 kernel-srpm-macros-1.0-25.el10.noarch keyutils-libs-1.6.3-5.el10.x86_64 krb5-libs-1.21.3-8.el10_0.x86_64 libacl-2.3.2-4.el10.x86_64 libarchive-3.7.7-4.el10_0.x86_64 libattr-2.5.2-5.el10.x86_64 libblkid-2.40.2-13.el10.x86_64 libbrotli-1.1.0-6.el10.x86_64 libcap-2.69-7.el10.x86_64 libcap-ng-0.8.4-6.el10.x86_64 libcom_err-1.47.1-4.el10.x86_64 libcurl-8.12.1-2.el10.x86_64 libeconf-0.6.2-4.el10.x86_64 libevent-2.1.12-16.el10.x86_64 libfdisk-2.40.2-13.el10.x86_64 libffi-3.4.4-10.el10.x86_64 libgcc-14.3.1-2.1.el10.x86_64 libgomp-14.3.1-2.1.el10.x86_64 libidn2-2.3.7-3.el10.x86_64 libmount-2.40.2-13.el10.x86_64 libnghttp2-1.64.0-2.el10.x86_64 libpkgconf-2.1.0-3.el10.x86_64 libpsl-0.21.5-6.el10.x86_64 libpwquality-1.4.5-12.el10.x86_64 libselinux-3.9-1.el10.x86_64 libsemanage-3.9-1.el10.x86_64 libsepol-3.9-1.el10.x86_64 libsmartcols-2.40.2-13.el10.x86_64 libssh-0.11.1-5.el10_1.x86_64 libssh-config-0.11.1-5.el10_1.noarch libstdc++-14.3.1-2.1.el10.x86_64 libtasn1-4.20.0-1.el10.x86_64 libunistring-1.1-10.el10.x86_64 libutempter-1.2.1-15.el10.x86_64 libuuid-2.40.2-13.el10.x86_64 libverto-0.3.2-10.el10.x86_64 libxcrypt-4.4.36-10.el10.x86_64 libxml2-2.12.5-9.el10_0.x86_64 libzstd-1.5.5-9.el10.x86_64 lua-libs-5.4.6-7.el10.x86_64 lua-srpm-macros-1-15.el10.noarch lz4-libs-1.9.4-8.el10.x86_64 mpfr-4.2.1-5.el10.x86_64 ncurses-base-6.4-14.20240127.el10.noarch ncurses-libs-6.4-14.20240127.el10.x86_64 ocaml-srpm-macros-10-4.el10.noarch openblas-srpm-macros-2-19.el10.noarch openldap-2.6.9-1.el10.x86_64 openssl-fips-provider-3.0.7-8.el10.x86_64 openssl-fips-provider-so-3.0.7-8.el10.x86_64 openssl-libs-3.5.1-4.el10_1.x86_64 p11-kit-0.25.5-7.el10.x86_64 p11-kit-trust-0.25.5-7.el10.x86_64 package-notes-srpm-macros-0.5-13.el10.noarch pam-1.6.1-8.el10.x86_64 pam-libs-1.6.1-8.el10.x86_64 patch-2.7.6-26.el10.x86_64 pcre2-10.44-1.el10.3.x86_64 pcre2-syntax-10.44-1.el10.3.noarch perl-srpm-macros-1-57.el10.noarch pkgconf-2.1.0-3.el10.x86_64 pkgconf-m4-2.1.0-3.el10.noarch pkgconf-pkg-config-2.1.0-3.el10.x86_64 popt-1.19-8.el10.x86_64 publicsuffix-list-dafsa-20240107-5.el10.noarch pyproject-srpm-macros-1.16.2-1.el10.noarch python-srpm-macros-3.12-10.el10.noarch qt6-srpm-macros-6.9.1-1.el10.noarch readline-8.2-11.el10.x86_64 redhat-release-10.1-18.el10.x86_64 redhat-rpm-config-293-1.el10.noarch rpm-4.19.1.1-20.el10.x86_64 rpm-build-4.19.1.1-20.el10.x86_64 rpm-build-libs-4.19.1.1-20.el10.x86_64 rpm-libs-4.19.1.1-20.el10.x86_64 rpm-sequoia-1.9.0.3-1.el10_1.x86_64 rust-toolset-srpm-macros-1.88.0-1.el10.noarch sed-4.9-3.el10.x86_64 setup-2.14.5-7.el10.noarch shadow-utils-4.15.0-8.el10.x86_64 sqlite-libs-3.46.1-5.el10_1.x86_64 systemd-libs-257-13.el10.x86_64 tar-1.35-7.el10.x86_64 unzip-6.0-69.el10.x86_64 util-linux-2.40.2-13.el10.x86_64 util-linux-core-2.40.2-13.el10.x86_64 which-2.21-44.el10_0.x86_64 xz-5.6.2-4.el10_0.x86_64 xz-libs-5.6.2-4.el10_0.x86_64 zip-3.0-45.el10.x86_64 zlib-ng-compat-2.2.3-2.el10.x86_64 zstd-1.5.5-9.el10.x86_64 Start: buildsrpm Start: rpmbuild -bs Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1766188800 Wrote: /builddir/build/SRPMS/llama-cpp-b6153-1.el10.src.rpm Finish: rpmbuild -bs INFO: chroot_scan: 3 files copied to /var/lib/copr-rpmbuild/results/chroot_scan INFO: /var/lib/mock/rhel+epel-10-x86_64-1766268696.705688/root/var/log/dnf.rpm.log /var/lib/mock/rhel+epel-10-x86_64-1766268696.705688/root/var/log/dnf.librepo.log /var/lib/mock/rhel+epel-10-x86_64-1766268696.705688/root/var/log/dnf.log INFO: chroot_scan: creating tarball /var/lib/copr-rpmbuild/results/chroot_scan.tar.gz /bin/tar: Removing leading `/' from member names Finish: buildsrpm INFO: Done(/var/lib/copr-rpmbuild/workspace/workdir-k0p141h_/llama-cpp/llama-cpp.spec) Config(child) 0 minutes 22 seconds INFO: Results and/or logs in: /var/lib/copr-rpmbuild/results INFO: Cleaning up build root ('cleanup_on_success=True') Start: clean chroot INFO: unmounting tmpfs. Finish: clean chroot INFO: Start(/var/lib/copr-rpmbuild/results/llama-cpp-b6153-1.el10.src.rpm) Config(rhel+epel-10-x86_64) Start: chroot init INFO: mounting tmpfs at /var/lib/mock/rhel+epel-10-x86_64-1766268696.705688/root. INFO: calling preinit hooks INFO: enabled root cache Start: unpacking root cache Finish: unpacking root cache INFO: enabled package manager cache Start: cleaning package manager metadata Finish: cleaning package manager metadata INFO: enabled HW Info plugin INFO: Buildroot is handled by package management from host and used with --installroot: rpm-4.20.1-1.fc42.x86_64 rpm-sequoia-1.7.0-5.fc42.x86_64 python3-dnf-4.24.0-1.fc42.noarch python3-dnf-plugins-core-4.10.1-1.fc42.noarch dnf5-5.2.17.0-1.fc42.x86_64 dnf5-plugins-5.2.17.0-1.fc42.x86_64 Finish: chroot init Start: build phase for llama-cpp-b6153-1.el10.src.rpm Start: build setup for llama-cpp-b6153-1.el10.src.rpm Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1766188800 Wrote: /builddir/build/SRPMS/llama-cpp-b6153-1.el10.src.rpm No matches found for the following disable plugin patterns: local, spacewalk, versionlock Updating Subscription Management repositories. Unable to read consumer identity This system is not registered with an entitlement server. You can use subscription-manager to register. Copr repository 77 kB/s | 1.5 kB 00:00 Red Hat Enterprise Linux 10 for x86_64 - BaseOS 105 kB/s | 4.1 kB 00:00 Red Hat Enterprise Linux 10 for x86_64 - AppStr 64 kB/s | 4.1 kB 00:00 Red Hat CodeReady Linux Builder for RHEL 10 x86 98 kB/s | 4.0 kB 00:00 Extra Packages for Enterprise Linux 10 - x86_64 60 kB/s | 33 kB 00:00 Package curl-8.12.1-2.el10.x86_64 is already installed. Dependencies resolved. ============================================================================================= Package Arch Version Repo Size ============================================================================================= Installing: cmake x86_64 3.30.5-3.el10_0 appstream 12 M gcc-c++ x86_64 14.3.1-2.1.el10 appstream 15 M git x86_64 2.47.3-1.el10_0 appstream 51 k hipblas-devel x86_64 6.4.1-2.el10_1 epel 106 k hipcc-libomp-devel x86_64 20-9.rocm7.1.1.el10 copr_base 15 k langpacks-en noarch 4.1-3.el10 appstream 12 k libcurl-devel x86_64 8.12.1-2.el10 appstream 948 k openmpi x86_64 2:5.0.2-5.el10 appstream 2.1 M pthreadpool-devel x86_64 0.0^git20230829.4fe0e1e-7.el10_1 epel 15 k rocblas-devel x86_64 6.4.2-7.el10_1 epel 108 k rocm-comgr-devel x86_64 20-9.rocm7.1.1.el10 copr_base 33 k rocm-hip-devel x86_64 6.4.2-1.el10_1 epel 233 k rocm-rpm-macros noarch 6.4.2-1.el10_1 epel 16 k rocm-runtime-devel x86_64 6.4.2-1.el10_1 epel 93 k wget x86_64 1.24.5-5.el10 appstream 807 k xxd x86_64 2:9.1.083-6.el10_1 appstream 31 k Installing dependencies: annobin-docs noarch 12.99-1.el10 appstream 88 k annobin-plugin-gcc x86_64 12.99-1.el10 appstream 996 k brotli x86_64 1.1.0-6.el10 appstream 22 k brotli-devel x86_64 1.1.0-6.el10 appstream 39 k clang-resource-filesystem x86_64 20.1.8-1.el10 appstream 17 k cmake-data noarch 3.30.5-3.el10_0 appstream 2.5 M cmake-filesystem x86_64 3.30.5-3.el10_0 appstream 24 k cmake-rpm-macros noarch 3.30.5-3.el10_0 appstream 16 k cpp x86_64 14.3.1-2.1.el10 appstream 13 M dbus x86_64 1:1.14.10-5.el10 baseos 8.5 k dbus-broker x86_64 36-4.el10 baseos 174 k dbus-common noarch 1:1.14.10-5.el10 baseos 19 k default-fonts-core-sans noarch 4.1-3.el10 baseos 34 k emacs-filesystem noarch 1:29.4-12.el10 appstream 10 k environment-modules x86_64 5.3.1-8.el10 baseos 711 k expat x86_64 2.7.1-1.el10_1.3 baseos 119 k fonts-filesystem noarch 1:2.0.5-18.el10 baseos 9.9 k gcc x86_64 14.3.1-2.1.el10 appstream 38 M gcc-plugin-annobin x86_64 14.3.1-2.1.el10 appstream 68 k git-core x86_64 2.47.3-1.el10_0 appstream 4.9 M git-core-doc noarch 2.47.3-1.el10_0 appstream 3.1 M glibc-devel x86_64 2.39-58.el10_1.2 appstream 602 k gnutls x86_64 3.8.10-2.el10 baseos 1.5 M google-noto-fonts-common noarch 20240401-5.el10 baseos 19 k google-noto-sans-mono-vf-fonts noarch 20240401-5.el10 baseos 282 k google-noto-sans-vf-fonts noarch 20240401-5.el10 baseos 596 k google-noto-serif-vf-fonts noarch 20240401-5.el10 baseos 648 k groff-base x86_64 1.23.0-10.el10 baseos 1.1 M hipblas x86_64 6.4.1-2.el10_1 epel 163 k hipblas-common-devel noarch 6.4.0-1.el10_1 epel 13 k hipcc x86_64 20-9.rocm7.1.1.el10 copr_base 133 k hwdata noarch 0.379-10.6.el10 baseos 1.7 M hwloc-libs x86_64 2.11.1-3.el10 baseos 2.1 M jsoncpp x86_64 1.9.5-9.el10 appstream 104 k kernel-headers x86_64 6.12.0-124.21.1.el10_1 appstream 3.2 M keyutils-libs-devel x86_64 1.6.3-5.el10 appstream 65 k krb5-devel x86_64 1.21.3-8.el10_0 appstream 145 k langpacks-core-en noarch 4.1-3.el10 appstream 12 k langpacks-fonts-en noarch 4.1-3.el10 appstream 12 k less x86_64 661-3.el10 baseos 195 k libcbor x86_64 0.11.0-3.el10 baseos 36 k libcom_err-devel x86_64 1.47.1-4.el10 appstream 17 k libdrm x86_64 2.4.123-1.el10 appstream 167 k libedit x86_64 3.1-52.20230828cvs.el10 baseos 108 k libfabric x86_64 2.1.0-1.el10 appstream 662 k libfido2 x86_64 1.14.0-7.el10 baseos 101 k libgfortran x86_64 14.3.1-2.1.el10 baseos 828 k libibverbs x86_64 57.0-2.el10 baseos 457 k libidn2-devel x86_64 2.3.7-3.el10 appstream 75 k libkadm5 x86_64 1.21.3-8.el10_0 baseos 78 k libmpc x86_64 1.3.1-7.el10 appstream 74 k libnghttp2-devel x86_64 1.64.0-2.el10 appstream 58 k libnl3 x86_64 3.11.0-1.el10 baseos 365 k libomp x86_64 20.1.8-1.el10 appstream 745 k libomp-devel x86_64 20.1.8-1.el10 appstream 283 k libpciaccess x86_64 0.16-16.el10 baseos 30 k libpipeline x86_64 1.5.7-7.el10 baseos 55 k libpsl-devel x86_64 0.21.5-6.el10 appstream 39 k libquadmath x86_64 14.3.1-2.1.el10 baseos 216 k librdmacm x86_64 57.0-2.el10 baseos 72 k libseccomp x86_64 2.5.6-1.el10 baseos 71 k libselinux-devel x86_64 3.9-1.el10 appstream 161 k libsepol-devel x86_64 3.9-1.el10 appstream 48 k libssh-devel x86_64 0.11.1-5.el10_1 appstream 42 k libstdc++-devel x86_64 14.3.1-2.1.el10 appstream 2.8 M libuv x86_64 1:1.51.0-1.el10_0 appstream 262 k libverto-devel x86_64 0.3.2-10.el10 appstream 16 k libxcrypt-devel x86_64 4.4.36-10.el10 appstream 33 k llvm-filesystem x86_64 20.1.8-1.el10 appstream 11 k llvm-libs x86_64 20.1.8-1.el10 appstream 30 M logrotate x86_64 3.22.0-4.el10 baseos 81 k make x86_64 1:4.4.1-9.el10 baseos 591 k man-db x86_64 2.12.0-10.el10 baseos 1.3 M mpdecimal x86_64 2.5.1-12.el10 baseos 92 k munge x86_64 0.5.15-10.el10 appstream 139 k munge-libs x86_64 0.5.15-10.el10 appstream 23 k ncurses x86_64 6.4-14.20240127.el10 baseos 427 k numactl-libs x86_64 2.0.19-2.el10 baseos 31 k ocl-icd x86_64 2.3.2-8.el10 baseos 69 k openssh x86_64 9.9p1-12.el10_1 baseos 351 k openssh-clients x86_64 9.9p1-12.el10_1 baseos 761 k openssl-devel x86_64 1:3.5.1-4.el10_1 appstream 4.2 M pcre2-devel x86_64 10.44-1.el10.3 appstream 536 k pcre2-utf16 x86_64 10.44-1.el10.3 appstream 228 k pcre2-utf32 x86_64 10.44-1.el10.3 appstream 216 k perl-AutoLoader noarch 5.74-512.2.el10_0 appstream 22 k perl-B x86_64 1.89-512.2.el10_0 appstream 178 k perl-Carp noarch 1.54-511.el10 appstream 31 k perl-Class-Struct noarch 0.68-512.2.el10_0 appstream 23 k perl-Data-Dumper x86_64 2.189-512.el10 appstream 60 k perl-Digest noarch 1.20-511.el10 appstream 28 k perl-Digest-MD5 x86_64 2.59-6.el10 appstream 40 k perl-DynaLoader x86_64 1.56-512.2.el10_0 appstream 27 k perl-Encode x86_64 4:3.21-511.el10 appstream 1.1 M perl-Errno x86_64 1.38-512.2.el10_0 appstream 16 k perl-Error noarch 1:0.17029-18.el10 appstream 46 k perl-Exporter noarch 5.78-511.el10 appstream 34 k perl-Fcntl x86_64 1.18-512.2.el10_0 appstream 31 k perl-File-Basename noarch 2.86-512.2.el10_0 appstream 18 k perl-File-Find noarch 1.44-512.2.el10_0 appstream 26 k perl-File-Path noarch 2.18-511.el10 appstream 37 k perl-File-Temp noarch 1:0.231.100-512.el10 appstream 63 k perl-File-stat noarch 1.14-512.2.el10_0 appstream 18 k perl-FileHandle noarch 2.05-512.2.el10_0 appstream 16 k perl-Getopt-Long noarch 1:2.58-3.el10 appstream 68 k perl-Getopt-Std noarch 1.14-512.2.el10_0 appstream 16 k perl-Git noarch 2.47.3-1.el10_0 appstream 38 k perl-HTTP-Tiny noarch 0.088-512.el10 appstream 60 k perl-IO x86_64 1.55-512.2.el10_0 appstream 81 k perl-IO-Socket-IP noarch 0.42-512.el10 appstream 45 k perl-IO-Socket-SSL noarch 2.085-3.el10 appstream 231 k perl-IPC-Open3 noarch 1.22-512.2.el10_0 appstream 23 k perl-MIME-Base64 x86_64 3.16-511.el10 appstream 34 k perl-Mozilla-CA noarch 20231213-5.el10 appstream 16 k perl-Net-SSLeay x86_64 1.94-8.el10 appstream 380 k perl-POSIX x86_64 2.20-512.2.el10_0 appstream 97 k perl-PathTools x86_64 3.91-512.el10 appstream 89 k perl-Pod-Escapes noarch 1:1.07-511.el10 appstream 22 k perl-Pod-Perldoc noarch 3.28.01-512.el10 appstream 88 k perl-Pod-Simple noarch 1:3.45-511.el10 appstream 223 k perl-Pod-Usage noarch 4:2.03-511.el10 appstream 43 k perl-Scalar-List-Utils x86_64 5:1.63-511.el10 appstream 78 k perl-SelectSaver noarch 1.02-512.2.el10_0 appstream 12 k perl-Socket x86_64 4:2.038-511.el10 appstream 59 k perl-Storable x86_64 1:3.32-511.el10 appstream 102 k perl-Symbol noarch 1.09-512.2.el10_0 appstream 15 k perl-Term-ANSIColor noarch 5.01-512.el10 appstream 51 k perl-Term-Cap noarch 1.18-511.el10 appstream 25 k perl-TermReadKey x86_64 2.38-24.el10 appstream 40 k perl-Text-ParseWords noarch 3.31-511.el10 appstream 19 k perl-Text-Tabs+Wrap noarch 2024.001-511.el10 appstream 24 k perl-Time-Local noarch 2:1.350-511.el10 appstream 38 k perl-URI noarch 5.27-3.el10 appstream 138 k perl-base noarch 2.27-512.2.el10_0 appstream 17 k perl-constant noarch 1.33-512.el10 appstream 25 k perl-if noarch 0.61.000-512.2.el10_0 appstream 15 k perl-interpreter x86_64 4:5.40.2-512.2.el10_0 appstream 73 k perl-lib x86_64 0.65-512.2.el10_0 appstream 16 k perl-libnet noarch 3.15-512.el10 appstream 131 k perl-libs x86_64 4:5.40.2-512.2.el10_0 appstream 2.4 M perl-locale noarch 1.12-512.2.el10_0 appstream 14 k perl-mro x86_64 1.29-512.2.el10_0 appstream 31 k perl-overload noarch 1.37-512.2.el10_0 appstream 46 k perl-overloading noarch 0.02-512.2.el10_0 appstream 14 k perl-parent noarch 1:0.241-512.el10 appstream 17 k perl-podlators noarch 1:5.01-511.el10 appstream 128 k perl-vars noarch 1.05-512.2.el10_0 appstream 14 k pmix x86_64 4.2.8-8.el10 appstream 746 k procps-ng x86_64 4.0.4-8.el10 baseos 374 k prrte x86_64 3.0.2-9.el10 appstream 86 k prrte-libs x86_64 3.0.2-9.el10 appstream 546 k pthreadpool x86_64 0.0^git20230829.4fe0e1e-7.el10_1 epel 48 k publicsuffix-list noarch 20240107-5.el10 appstream 90 k python3 x86_64 3.12.11-3.el10 baseos 28 k python3-libs x86_64 3.12.11-3.el10 baseos 9.4 M python3-pip-wheel noarch 23.3.2-7.el10 baseos 1.5 M redhat-mono-vf-fonts noarch 4.1.0-1.el10 baseos 346 k redhat-text-vf-fonts noarch 4.1.0-1.el10 baseos 357 k rocblas x86_64 6.4.2-7.el10_1 epel 158 M rocm-clang x86_64 20-9.rocm7.1.1.el10 copr_base 16 M rocm-clang-devel x86_64 20-9.rocm7.1.1.el10 copr_base 2.5 M rocm-clang-libs x86_64 20-9.rocm7.1.1.el10 copr_base 23 M rocm-clang-runtime-devel x86_64 20-9.rocm7.1.1.el10 copr_base 529 k rocm-comgr x86_64 20-9.rocm7.1.1.el10 copr_base 31 M rocm-device-libs x86_64 20-9.rocm7.1.1.el10 copr_base 491 k rocm-hip x86_64 6.4.2-1.el10_1 epel 9.4 M rocm-libc++ x86_64 20-9.rocm7.1.1.el10 copr_base 378 k rocm-libc++-devel x86_64 20-9.rocm7.1.1.el10 copr_base 1.2 M rocm-lld x86_64 20-9.rocm7.1.1.el10 copr_base 1.6 M rocm-llvm x86_64 20-9.rocm7.1.1.el10 copr_base 14 M rocm-llvm-devel x86_64 20-9.rocm7.1.1.el10 copr_base 4.0 M rocm-llvm-filesystem x86_64 20-9.rocm7.1.1.el10 copr_base 25 k rocm-llvm-libs x86_64 20-9.rocm7.1.1.el10 copr_base 21 M rocm-llvm-static x86_64 20-9.rocm7.1.1.el10 copr_base 31 M rocm-runtime x86_64 6.4.2-1.el10_1 epel 654 k rocsolver x86_64 6.4.2-2.el10_1 epel 118 M systemd x86_64 257-13.el10 baseos 5.7 M systemd-pam x86_64 257-13.el10 baseos 306 k systemd-rpm-macros noarch 257-13.el10 baseos 27 k tcl x86_64 1:8.6.13-4.el10 baseos 1.1 M torque-libs x86_64 6.1.3-16.el10 appstream 190 k tzdata noarch 2025c-1.el10 baseos 904 k ucx x86_64 1.18.1-1.el10 appstream 864 k vim-filesystem noarch 2:9.1.083-6.el10_1 baseos 16 k zlib-ng-compat-devel x86_64 2.2.3-2.el10 appstream 39 k Transaction Summary ============================================================================================= Install 201 Packages Total download size: 618 M Installed size: 1.7 G Downloading Packages: (1/201): hipcc-libomp-devel-20-9.rocm7.1.1.el10 143 kB/s | 15 kB 00:00 (2/201): rocm-clang-devel-20-9.rocm7.1.1.el10.x 21 MB/s | 2.5 MB 00:00 (3/201): hipcc-20-9.rocm7.1.1.el10.x86_64.rpm 344 kB/s | 133 kB 00:00 (4/201): rocm-clang-runtime-devel-20-9.rocm7.1. 1.0 MB/s | 529 kB 00:00 (5/201): rocm-clang-libs-20-9.rocm7.1.1.el10.x8 22 MB/s | 23 MB 00:01 (6/201): rocm-comgr-devel-20-9.rocm7.1.1.el10.x 728 kB/s | 33 kB 00:00 (7/201): rocm-clang-20-9.rocm7.1.1.el10.x86_64. 12 MB/s | 16 MB 00:01 (8/201): rocm-device-libs-20-9.rocm7.1.1.el10.x 8.7 MB/s | 491 kB 00:00 (9/201): rocm-libc++-20-9.rocm7.1.1.el10.x86_64 3.0 MB/s | 378 kB 00:00 (10/201): rocm-lld-20-9.rocm7.1.1.el10.x86_64.r 12 MB/s | 1.6 MB 00:00 (11/201): rocm-comgr-20-9.rocm7.1.1.el10.x86_64 36 MB/s | 31 MB 00:00 (12/201): rocm-libc++-devel-20-9.rocm7.1.1.el10 1.8 MB/s | 1.2 MB 00:00 (13/201): rocm-llvm-filesystem-20-9.rocm7.1.1.e 774 kB/s | 25 kB 00:00 (14/201): rocm-llvm-devel-20-9.rocm7.1.1.el10.x 12 MB/s | 4.0 MB 00:00 (15/201): rocm-llvm-20-9.rocm7.1.1.el10.x86_64. 15 MB/s | 14 MB 00:00 (16/201): dbus-1.14.10-5.el10.x86_64.rpm 270 kB/s | 8.5 kB 00:00 (17/201): dbus-common-1.14.10-5.el10.noarch.rpm 1.1 MB/s | 19 kB 00:00 (18/201): default-fonts-core-sans-4.1-3.el10.no 1.8 MB/s | 34 kB 00:00 (19/201): environment-modules-5.3.1-8.el10.x86_ 16 MB/s | 711 kB 00:00 (20/201): fonts-filesystem-2.0.5-18.el10.noarch 883 kB/s | 9.9 kB 00:00 (21/201): google-noto-fonts-common-20240401-5.e 835 kB/s | 19 kB 00:00 (22/201): google-noto-sans-mono-vf-fonts-202404 9.5 MB/s | 282 kB 00:00 (23/201): google-noto-sans-vf-fonts-20240401-5. 34 MB/s | 596 kB 00:00 (24/201): google-noto-serif-vf-fonts-20240401-5 26 MB/s | 648 kB 00:00 (25/201): rocm-llvm-libs-20-9.rocm7.1.1.el10.x8 30 MB/s | 21 MB 00:00 (26/201): groff-base-1.23.0-10.el10.x86_64.rpm 27 MB/s | 1.1 MB 00:00 (27/201): less-661-3.el10.x86_64.rpm 9.5 MB/s | 195 kB 00:00 (28/201): libcbor-0.11.0-3.el10.x86_64.rpm 1.8 MB/s | 36 kB 00:00 (29/201): libedit-3.1-52.20230828cvs.el10.x86_6 7.2 MB/s | 108 kB 00:00 (30/201): hwloc-libs-2.11.1-3.el10.x86_64.rpm 31 MB/s | 2.1 MB 00:00 (31/201): libfido2-1.14.0-7.el10.x86_64.rpm 4.8 MB/s | 101 kB 00:00 (32/201): libpciaccess-0.16-16.el10.x86_64.rpm 1.6 MB/s | 30 kB 00:00 (33/201): libnl3-3.11.0-1.el10.x86_64.rpm 4.5 MB/s | 365 kB 00:00 (34/201): logrotate-3.22.0-4.el10.x86_64.rpm 3.4 MB/s | 81 kB 00:00 (35/201): libpipeline-1.5.7-7.el10.x86_64.rpm 713 kB/s | 55 kB 00:00 (36/201): make-4.4.1-9.el10.x86_64.rpm 29 MB/s | 591 kB 00:00 (37/201): mpdecimal-2.5.1-12.el10.x86_64.rpm 4.5 MB/s | 92 kB 00:00 (38/201): ncurses-6.4-14.20240127.el10.x86_64.r 26 MB/s | 427 kB 00:00 (39/201): ocl-icd-2.3.2-8.el10.x86_64.rpm 3.7 MB/s | 69 kB 00:00 (40/201): python3-pip-wheel-23.3.2-7.el10.noarc 78 MB/s | 1.5 MB 00:00 (41/201): libkadm5-1.21.3-8.el10_0.x86_64.rpm 3.1 MB/s | 78 kB 00:00 (42/201): dbus-broker-36-4.el10.x86_64.rpm 1.9 MB/s | 174 kB 00:00 (43/201): tcl-8.6.13-4.el10.x86_64.rpm 8.5 MB/s | 1.1 MB 00:00 (44/201): gnutls-3.8.10-2.el10.x86_64.rpm 31 MB/s | 1.5 MB 00:00 (45/201): hwdata-0.379-10.6.el10.noarch.rpm 35 MB/s | 1.7 MB 00:00 (46/201): libibverbs-57.0-2.el10.x86_64.rpm 15 MB/s | 457 kB 00:00 (47/201): libgfortran-14.3.1-2.1.el10.x86_64.rp 24 MB/s | 828 kB 00:00 (48/201): libquadmath-14.3.1-2.1.el10.x86_64.rp 15 MB/s | 216 kB 00:00 (49/201): libseccomp-2.5.6-1.el10.x86_64.rpm 4.7 MB/s | 71 kB 00:00 (50/201): rocm-llvm-static-20-9.rocm7.1.1.el10. 27 MB/s | 31 MB 00:01 (51/201): librdmacm-57.0-2.el10.x86_64.rpm 943 kB/s | 72 kB 00:00 (52/201): numactl-libs-2.0.19-2.el10.x86_64.rpm 2.2 MB/s | 31 kB 00:00 (53/201): man-db-2.12.0-10.el10.x86_64.rpm 19 MB/s | 1.3 MB 00:00 (54/201): python3-3.12.11-3.el10.x86_64.rpm 2.1 MB/s | 28 kB 00:00 (55/201): procps-ng-4.0.4-8.el10.x86_64.rpm 6.1 MB/s | 374 kB 00:00 (56/201): python3-libs-3.12.11-3.el10.x86_64.rp 181 MB/s | 9.4 MB 00:00 (57/201): redhat-mono-vf-fonts-4.1.0-1.el10.noa 6.5 MB/s | 346 kB 00:00 (58/201): redhat-text-vf-fonts-4.1.0-1.el10.noa 16 MB/s | 357 kB 00:00 (59/201): systemd-rpm-macros-257-13.el10.noarch 2.1 MB/s | 27 kB 00:00 (60/201): systemd-257-13.el10.x86_64.rpm 203 MB/s | 5.7 MB 00:00 (61/201): systemd-pam-257-13.el10.x86_64.rpm 13 MB/s | 306 kB 00:00 (62/201): expat-2.7.1-1.el10_1.3.x86_64.rpm 8.4 MB/s | 119 kB 00:00 (63/201): openssh-clients-9.9p1-12.el10_1.x86_6 42 MB/s | 761 kB 00:00 (64/201): vim-filesystem-9.1.083-6.el10_1.noarc 319 kB/s | 16 kB 00:00 (65/201): openssh-9.9p1-12.el10_1.x86_64.rpm 7.8 MB/s | 351 kB 00:00 (66/201): tzdata-2025c-1.el10.noarch.rpm 49 MB/s | 904 kB 00:00 (67/201): libverto-devel-0.3.2-10.el10.x86_64.r 1.1 MB/s | 16 kB 00:00 (68/201): pcre2-utf32-10.44-1.el10.3.x86_64.rpm 17 MB/s | 216 kB 00:00 (69/201): perl-Data-Dumper-2.189-512.el10.x86_6 4.4 MB/s | 60 kB 00:00 (70/201): perl-Error-0.17029-18.el10.noarch.rpm 3.2 MB/s | 46 kB 00:00 (71/201): perl-Exporter-5.78-511.el10.noarch.rp 2.4 MB/s | 34 kB 00:00 (72/201): perl-HTTP-Tiny-0.088-512.el10.noarch. 4.2 MB/s | 60 kB 00:00 (73/201): brotli-1.1.0-6.el10.x86_64.rpm 365 kB/s | 22 kB 00:00 (74/201): perl-Mozilla-CA-20231213-5.el10.noarc 800 kB/s | 16 kB 00:00 (75/201): perl-Pod-Simple-3.45-511.el10.noarch. 13 MB/s | 223 kB 00:00 (76/201): perl-Scalar-List-Utils-1.63-511.el10. 6.2 MB/s | 78 kB 00:00 (77/201): perl-Term-ANSIColor-5.01-512.el10.noa 4.0 MB/s | 51 kB 00:00 (78/201): perl-Term-Cap-1.18-511.el10.noarch.rp 1.9 MB/s | 25 kB 00:00 (79/201): perl-constant-1.33-512.el10.noarch.rp 1.4 MB/s | 25 kB 00:00 (80/201): brotli-devel-1.1.0-6.el10.x86_64.rpm 2.8 MB/s | 39 kB 00:00 (81/201): wget-1.24.5-5.el10.x86_64.rpm 49 MB/s | 807 kB 00:00 (82/201): libidn2-devel-2.3.7-3.el10.x86_64.rpm 5.1 MB/s | 75 kB 00:00 (83/201): libpsl-devel-0.21.5-6.el10.x86_64.rpm 2.4 MB/s | 39 kB 00:00 (84/201): libnghttp2-devel-1.64.0-2.el10.x86_64 3.0 MB/s | 58 kB 00:00 (85/201): pcre2-utf16-10.44-1.el10.3.x86_64.rpm 16 MB/s | 228 kB 00:00 (86/201): perl-Digest-1.20-511.el10.noarch.rpm 2.3 MB/s | 28 kB 00:00 (87/201): perl-File-Temp-0.231.100-512.el10.noa 2.7 MB/s | 63 kB 00:00 (88/201): perl-Carp-1.54-511.el10.noarch.rpm 649 kB/s | 31 kB 00:00 (89/201): perl-Getopt-Long-2.58-3.el10.noarch.r 1.8 MB/s | 68 kB 00:00 (90/201): perl-IO-Socket-IP-0.42-512.el10.noarc 1.2 MB/s | 45 kB 00:00 (91/201): perl-Pod-Escapes-1.07-511.el10.noarch 1.2 MB/s | 22 kB 00:00 (92/201): openmpi-5.0.2-5.el10.x86_64.rpm 19 MB/s | 2.1 MB 00:00 (93/201): perl-MIME-Base64-3.16-511.el10.x86_64 1.1 MB/s | 34 kB 00:00 (94/201): perl-Socket-2.038-511.el10.x86_64.rpm 1.3 MB/s | 59 kB 00:00 (95/201): perl-Time-Local-1.350-511.el10.noarch 2.4 MB/s | 38 kB 00:00 (96/201): perl-Pod-Usage-2.03-511.el10.noarch.r 585 kB/s | 43 kB 00:00 (97/201): perl-TermReadKey-2.38-24.el10.x86_64. 636 kB/s | 40 kB 00:00 (98/201): perl-libnet-3.15-512.el10.noarch.rpm 9.2 MB/s | 131 kB 00:00 (99/201): keyutils-libs-devel-1.6.3-5.el10.x86_ 3.0 MB/s | 65 kB 00:00 (100/201): langpacks-en-4.1-3.el10.noarch.rpm 569 kB/s | 12 kB 00:00 (101/201): perl-Digest-MD5-2.59-6.el10.x86_64.r 1.8 MB/s | 40 kB 00:00 (102/201): perl-Encode-3.21-511.el10.x86_64.rpm 75 MB/s | 1.1 MB 00:00 (103/201): perl-PathTools-3.91-512.el10.x86_64. 6.0 MB/s | 89 kB 00:00 (104/201): perl-Storable-3.32-511.el10.x86_64.r 7.6 MB/s | 102 kB 00:00 (105/201): perl-URI-5.27-3.el10.noarch.rpm 8.9 MB/s | 138 kB 00:00 (106/201): perl-podlators-5.01-511.el10.noarch. 8.3 MB/s | 128 kB 00:00 (107/201): perl-Text-Tabs+Wrap-2024.001-511.el1 765 kB/s | 24 kB 00:00 (108/201): perl-parent-0.241-512.el10.noarch.rp 574 kB/s | 17 kB 00:00 (109/201): pmix-4.2.8-8.el10.x86_64.rpm 43 MB/s | 746 kB 00:00 (110/201): prrte-libs-3.0.2-9.el10.x86_64.rpm 28 MB/s | 546 kB 00:00 (111/201): jsoncpp-1.9.5-9.el10.x86_64.rpm 7.9 MB/s | 104 kB 00:00 (112/201): torque-libs-6.1.3-16.el10.x86_64.rpm 5.6 MB/s | 190 kB 00:00 (113/201): libdrm-2.4.123-1.el10.x86_64.rpm 11 MB/s | 167 kB 00:00 (114/201): langpacks-fonts-en-4.1-3.el10.noarch 450 kB/s | 12 kB 00:00 (115/201): munge-libs-0.5.15-10.el10.x86_64.rpm 810 kB/s | 23 kB 00:00 (116/201): pcre2-devel-10.44-1.el10.3.x86_64.rp 19 MB/s | 536 kB 00:00 (117/201): perl-File-Path-2.18-511.el10.noarch. 1.3 MB/s | 37 kB 00:00 (118/201): perl-IO-Socket-SSL-2.085-3.el10.noar 11 MB/s | 231 kB 00:00 (119/201): perl-Text-ParseWords-3.31-511.el10.n 1.2 MB/s | 19 kB 00:00 (120/201): prrte-3.0.2-9.el10.x86_64.rpm 5.3 MB/s | 86 kB 00:00 (121/201): langpacks-core-en-4.1-3.el10.noarch. 914 kB/s | 12 kB 00:00 (122/201): libxcrypt-devel-4.4.36-10.el10.x86_6 2.5 MB/s | 33 kB 00:00 (123/201): libmpc-1.3.1-7.el10.x86_64.rpm 4.9 MB/s | 74 kB 00:00 (124/201): munge-0.5.15-10.el10.x86_64.rpm 11 MB/s | 139 kB 00:00 (125/201): perl-Pod-Perldoc-3.28.01-512.el10.no 6.7 MB/s | 88 kB 00:00 (126/201): publicsuffix-list-20240107-5.el10.no 7.0 MB/s | 90 kB 00:00 (127/201): cmake-filesystem-3.30.5-3.el10_0.x86 886 kB/s | 24 kB 00:00 (128/201): cmake-data-3.30.5-3.el10_0.noarch.rp 75 MB/s | 2.5 MB 00:00 (129/201): cmake-rpm-macros-3.30.5-3.el10_0.noa 1.2 MB/s | 16 kB 00:00 (130/201): git-2.47.3-1.el10_0.x86_64.rpm 1.6 MB/s | 51 kB 00:00 (131/201): git-core-2.47.3-1.el10_0.x86_64.rpm 148 MB/s | 4.9 MB 00:00 (132/201): cmake-3.30.5-3.el10_0.x86_64.rpm 127 MB/s | 12 MB 00:00 (133/201): krb5-devel-1.21.3-8.el10_0.x86_64.rp 9.1 MB/s | 145 kB 00:00 (134/201): perl-AutoLoader-5.74-512.2.el10_0.no 1.4 MB/s | 22 kB 00:00 (135/201): perl-B-1.89-512.2.el10_0.x86_64.rpm 4.7 MB/s | 178 kB 00:00 (136/201): git-core-doc-2.47.3-1.el10_0.noarch. 48 MB/s | 3.1 MB 00:00 (137/201): perl-Class-Struct-0.68-512.2.el10_0. 507 kB/s | 23 kB 00:00 (138/201): perl-Fcntl-1.18-512.2.el10_0.x86_64. 1.0 MB/s | 31 kB 00:00 (139/201): perl-Errno-1.38-512.2.el10_0.x86_64. 296 kB/s | 16 kB 00:00 (140/201): perl-DynaLoader-1.56-512.2.el10_0.x8 467 kB/s | 27 kB 00:00 (141/201): perl-File-stat-1.14-512.2.el10_0.noa 540 kB/s | 18 kB 00:00 (142/201): perl-File-Basename-2.86-512.2.el10_0 378 kB/s | 18 kB 00:00 (143/201): perl-File-Find-1.44-512.2.el10_0.noa 432 kB/s | 26 kB 00:00 (144/201): perl-Getopt-Std-1.14-512.2.el10_0.no 1.0 MB/s | 16 kB 00:00 (145/201): perl-IO-1.55-512.2.el10_0.x86_64.rpm 6.1 MB/s | 81 kB 00:00 (146/201): perl-Git-2.47.3-1.el10_0.noarch.rpm 2.8 MB/s | 38 kB 00:00 (147/201): perl-IPC-Open3-1.22-512.2.el10_0.noa 1.6 MB/s | 23 kB 00:00 (148/201): perl-POSIX-2.20-512.2.el10_0.x86_64. 6.9 MB/s | 97 kB 00:00 (149/201): perl-FileHandle-2.05-512.2.el10_0.no 252 kB/s | 16 kB 00:00 (150/201): perl-SelectSaver-1.02-512.2.el10_0.n 1.0 MB/s | 12 kB 00:00 (151/201): perl-Symbol-1.09-512.2.el10_0.noarch 1.1 MB/s | 15 kB 00:00 (152/201): perl-base-2.27-512.2.el10_0.noarch.r 1.2 MB/s | 17 kB 00:00 (153/201): perl-interpreter-5.40.2-512.2.el10_0 5.6 MB/s | 73 kB 00:00 (154/201): perl-if-0.61.000-512.2.el10_0.noarch 910 kB/s | 15 kB 00:00 (155/201): perl-lib-0.65-512.2.el10_0.x86_64.rp 1.2 MB/s | 16 kB 00:00 (156/201): perl-libs-5.40.2-512.2.el10_0.x86_64 94 MB/s | 2.4 MB 00:00 (157/201): perl-mro-1.29-512.2.el10_0.x86_64.rp 2.0 MB/s | 31 kB 00:00 (158/201): perl-locale-1.12-512.2.el10_0.noarch 399 kB/s | 14 kB 00:00 (159/201): perl-overload-1.37-512.2.el10_0.noar 3.7 MB/s | 46 kB 00:00 (160/201): perl-overloading-0.02-512.2.el10_0.n 742 kB/s | 14 kB 00:00 (161/201): annobin-plugin-gcc-12.99-1.el10.x86_ 73 MB/s | 996 kB 00:00 (162/201): perl-vars-1.05-512.2.el10_0.noarch.r 968 kB/s | 14 kB 00:00 (163/201): emacs-filesystem-29.4-12.el10.noarch 933 kB/s | 10 kB 00:00 (164/201): clang-resource-filesystem-20.1.8-1.e 627 kB/s | 17 kB 00:00 (165/201): libcom_err-devel-1.47.1-4.el10.x86_6 1.3 MB/s | 17 kB 00:00 (166/201): cpp-14.3.1-2.1.el10.x86_64.rpm 233 MB/s | 13 MB 00:00 (167/201): libfabric-2.1.0-1.el10.x86_64.rpm 27 MB/s | 662 kB 00:00 (168/201): libuv-1.51.0-1.el10_0.x86_64.rpm 11 MB/s | 262 kB 00:00 (169/201): llvm-filesystem-20.1.8-1.el10.x86_64 593 kB/s | 11 kB 00:00 (170/201): perl-Net-SSLeay-1.94-8.el10.x86_64.r 25 MB/s | 380 kB 00:00 (171/201): gcc-14.3.1-2.1.el10.x86_64.rpm 235 MB/s | 38 MB 00:00 (172/201): zlib-ng-compat-devel-2.2.3-2.el10.x8 672 kB/s | 39 kB 00:00 (173/201): libstdc++-devel-14.3.1-2.1.el10.x86_ 23 MB/s | 2.8 MB 00:00 (174/201): annobin-docs-12.99-1.el10.noarch.rpm 4.8 MB/s | 88 kB 00:00 (175/201): gcc-plugin-annobin-14.3.1-2.1.el10.x 4.9 MB/s | 68 kB 00:00 (176/201): libcurl-devel-8.12.1-2.el10.x86_64.r 72 MB/s | 948 kB 00:00 (177/201): libomp-20.1.8-1.el10.x86_64.rpm 32 MB/s | 745 kB 00:00 (178/201): libselinux-devel-3.9-1.el10.x86_64.r 13 MB/s | 161 kB 00:00 (179/201): libsepol-devel-3.9-1.el10.x86_64.rpm 4.1 MB/s | 48 kB 00:00 (180/201): gcc-c++-14.3.1-2.1.el10.x86_64.rpm 169 MB/s | 15 MB 00:00 (181/201): libomp-devel-20.1.8-1.el10.x86_64.rp 4.6 MB/s | 283 kB 00:00 (182/201): glibc-devel-2.39-58.el10_1.2.x86_64. 42 MB/s | 602 kB 00:00 (183/201): ucx-1.18.1-1.el10.x86_64.rpm 36 MB/s | 864 kB 00:00 (184/201): xxd-9.1.083-6.el10_1.x86_64.rpm 2.3 MB/s | 31 kB 00:00 (185/201): openssl-devel-3.5.1-4.el10_1.x86_64. 185 MB/s | 4.2 MB 00:00 (186/201): kernel-headers-6.12.0-124.21.1.el10_ 124 MB/s | 3.2 MB 00:00 (187/201): libssh-devel-0.11.1-5.el10_1.x86_64. 1.4 MB/s | 42 kB 00:00 (188/201): hipblas-6.4.1-2.el10_1.x86_64.rpm 4.0 MB/s | 163 kB 00:00 (189/201): hipblas-devel-6.4.1-2.el10_1.x86_64. 4.2 MB/s | 106 kB 00:00 (190/201): hipblas-common-devel-6.4.0-1.el10_1. 293 kB/s | 13 kB 00:00 (191/201): llvm-libs-20.1.8-1.el10.x86_64.rpm 158 MB/s | 30 MB 00:00 (192/201): pthreadpool-0.0^git20230829.4fe0e1e- 1.0 MB/s | 48 kB 00:00 (193/201): pthreadpool-devel-0.0^git20230829.4f 291 kB/s | 15 kB 00:00 (194/201): rocblas-devel-6.4.2-7.el10_1.x86_64. 2.1 MB/s | 108 kB 00:00 (195/201): rocm-hip-devel-6.4.2-1.el10_1.x86_64 3.2 MB/s | 233 kB 00:00 (196/201): rocm-hip-6.4.2-1.el10_1.x86_64.rpm 70 MB/s | 9.4 MB 00:00 (197/201): rocm-rpm-macros-6.4.2-1.el10_1.noarc 596 kB/s | 16 kB 00:00 (198/201): rocm-runtime-6.4.2-1.el10_1.x86_64.r 22 MB/s | 654 kB 00:00 (199/201): rocm-runtime-devel-6.4.2-1.el10_1.x8 3.4 MB/s | 93 kB 00:00 (200/201): rocsolver-6.4.2-2.el10_1.x86_64.rpm 110 MB/s | 118 MB 00:01 (201/201): rocblas-6.4.2-7.el10_1.x86_64.rpm 109 MB/s | 158 MB 00:01 -------------------------------------------------------------------------------- Total 96 MB/s | 618 MB 00:06 Running transaction check Transaction check succeeded. Running transaction test Transaction test succeeded. Running transaction Preparing : 1/1 Installing : cmake-filesystem-3.30.5-3.el10_0.x86_64 1/201 Installing : fonts-filesystem-1:2.0.5-18.el10.noarch 2/201 Installing : expat-2.7.1-1.el10_1.3.x86_64 3/201 Installing : libmpc-1.3.1-7.el10.x86_64 4/201 Installing : munge-libs-0.5.15-10.el10.x86_64 5/201 Installing : libnl3-3.11.0-1.el10.x86_64 6/201 Installing : less-661-3.el10.x86_64 7/201 Installing : google-noto-fonts-common-20240401-5.el10.noarch 8/201 Installing : libibverbs-57.0-2.el10.x86_64 9/201 Installing : zlib-ng-compat-devel-2.2.3-2.el10.x86_64 10/201 Installing : vim-filesystem-2:9.1.083-6.el10_1.noarch 11/201 Installing : numactl-libs-2.0.19-2.el10.x86_64 12/201 Installing : make-1:4.4.1-9.el10.x86_64 13/201 Installing : libedit-3.1-52.20230828cvs.el10.x86_64 14/201 Running scriptlet: groff-base-1.23.0-10.el10.x86_64 15/201 Installing : groff-base-1.23.0-10.el10.x86_64 15/201 Running scriptlet: groff-base-1.23.0-10.el10.x86_64 15/201 Installing : rocm-llvm-filesystem-20-9.rocm7.1.1.el10.x86_64 16/201 Installing : rocm-libc++-20-9.rocm7.1.1.el10.x86_64 17/201 Running scriptlet: rocm-libc++-20-9.rocm7.1.1.el10.x86_64 17/201 Installing : rocm-llvm-libs-20-9.rocm7.1.1.el10.x86_64 18/201 Running scriptlet: rocm-llvm-libs-20-9.rocm7.1.1.el10.x86_64 18/201 Installing : rocm-clang-libs-20-9.rocm7.1.1.el10.x86_64 19/201 Running scriptlet: rocm-clang-libs-20-9.rocm7.1.1.el10.x86_64 19/201 Installing : rocm-comgr-20-9.rocm7.1.1.el10.x86_64 20/201 Running scriptlet: rocm-comgr-20-9.rocm7.1.1.el10.x86_64 20/201 Installing : rocm-lld-20-9.rocm7.1.1.el10.x86_64 21/201 Installing : rocm-libc++-devel-20-9.rocm7.1.1.el10.x86_64 22/201 Installing : librdmacm-57.0-2.el10.x86_64 23/201 Installing : libfabric-2.1.0-1.el10.x86_64 24/201 Installing : google-noto-sans-mono-vf-fonts-20240401-5.el10.n 25/201 Installing : google-noto-sans-vf-fonts-20240401-5.el10.noarch 26/201 Installing : google-noto-serif-vf-fonts-20240401-5.el10.noarc 27/201 Installing : cpp-14.3.1-2.1.el10.x86_64 28/201 Installing : redhat-mono-vf-fonts-4.1.0-1.el10.noarch 29/201 Installing : redhat-text-vf-fonts-4.1.0-1.el10.noarch 30/201 Installing : default-fonts-core-sans-4.1-3.el10.noarch 31/201 Installing : langpacks-fonts-en-4.1-3.el10.noarch 32/201 Installing : langpacks-core-en-4.1-3.el10.noarch 33/201 Installing : libssh-devel-0.11.1-5.el10_1.x86_64 34/201 Installing : hipblas-common-devel-6.4.0-1.el10_1.noarch 35/201 Installing : pthreadpool-0.0^git20230829.4fe0e1e-7.el10_1.x86 36/201 Installing : kernel-headers-6.12.0-124.21.1.el10_1.x86_64 37/201 Installing : glibc-devel-2.39-58.el10_1.2.x86_64 38/201 Installing : libxcrypt-devel-4.4.36-10.el10.x86_64 39/201 Installing : gcc-14.3.1-2.1.el10.x86_64 40/201 Running scriptlet: gcc-14.3.1-2.1.el10.x86_64 40/201 Installing : openssl-devel-1:3.5.1-4.el10_1.x86_64 41/201 Installing : ucx-1.18.1-1.el10.x86_64 42/201 Installing : libsepol-devel-3.9-1.el10.x86_64 43/201 Installing : annobin-docs-12.99-1.el10.noarch 44/201 Installing : llvm-filesystem-20.1.8-1.el10.x86_64 45/201 Installing : llvm-libs-20.1.8-1.el10.x86_64 46/201 Installing : libomp-20.1.8-1.el10.x86_64 47/201 Installing : libuv-1:1.51.0-1.el10_0.x86_64 48/201 Installing : libstdc++-devel-14.3.1-2.1.el10.x86_64 49/201 Installing : libcom_err-devel-1.47.1-4.el10.x86_64 50/201 Installing : emacs-filesystem-1:29.4-12.el10.noarch 51/201 Installing : clang-resource-filesystem-20.1.8-1.el10.x86_64 52/201 Installing : libomp-devel-20.1.8-1.el10.x86_64 53/201 Installing : publicsuffix-list-20240107-5.el10.noarch 54/201 Installing : libpsl-devel-0.21.5-6.el10.x86_64 55/201 Installing : jsoncpp-1.9.5-9.el10.x86_64 56/201 Installing : keyutils-libs-devel-1.6.3-5.el10.x86_64 57/201 Installing : pcre2-utf16-10.44-1.el10.3.x86_64 58/201 Installing : libnghttp2-devel-1.64.0-2.el10.x86_64 59/201 Installing : libidn2-devel-2.3.7-3.el10.x86_64 60/201 Installing : pcre2-utf32-10.44-1.el10.3.x86_64 61/201 Installing : pcre2-devel-10.44-1.el10.3.x86_64 62/201 Installing : libselinux-devel-3.9-1.el10.x86_64 63/201 Installing : libverto-devel-0.3.2-10.el10.x86_64 64/201 Installing : brotli-1.1.0-6.el10.x86_64 65/201 Installing : brotli-devel-1.1.0-6.el10.x86_64 66/201 Installing : tzdata-2025c-1.el10.noarch 67/201 Installing : openssh-9.9p1-12.el10_1.x86_64 68/201 Installing : procps-ng-4.0.4-8.el10.x86_64 69/201 Installing : libseccomp-2.5.6-1.el10.x86_64 70/201 Installing : libquadmath-14.3.1-2.1.el10.x86_64 71/201 Installing : libgfortran-14.3.1-2.1.el10.x86_64 72/201 Installing : hwdata-0.379-10.6.el10.noarch 73/201 Installing : libpciaccess-0.16-16.el10.x86_64 74/201 Installing : libdrm-2.4.123-1.el10.x86_64 75/201 Installing : rocm-runtime-6.4.2-1.el10_1.x86_64 76/201 Installing : rocm-runtime-devel-6.4.2-1.el10_1.x86_64 77/201 Installing : gnutls-3.8.10-2.el10.x86_64 78/201 Installing : libkadm5-1.21.3-8.el10_0.x86_64 79/201 Installing : krb5-devel-1.21.3-8.el10_0.x86_64 80/201 Installing : tcl-1:8.6.13-4.el10.x86_64 81/201 Installing : python3-pip-wheel-23.3.2-7.el10.noarch 82/201 Installing : ocl-icd-2.3.2-8.el10.x86_64 83/201 Installing : hwloc-libs-2.11.1-3.el10.x86_64 84/201 Installing : pmix-4.2.8-8.el10.x86_64 85/201 Installing : ncurses-6.4-14.20240127.el10.x86_64 86/201 Installing : perl-Digest-1.20-511.el10.noarch 87/201 Installing : perl-Digest-MD5-2.59-6.el10.x86_64 88/201 Installing : perl-B-1.89-512.2.el10_0.x86_64 89/201 Installing : perl-FileHandle-2.05-512.2.el10_0.noarch 90/201 Installing : perl-Data-Dumper-2.189-512.el10.x86_64 91/201 Installing : perl-libnet-3.15-512.el10.noarch 92/201 Installing : perl-AutoLoader-5.74-512.2.el10_0.noarch 93/201 Installing : perl-IO-Socket-IP-0.42-512.el10.noarch 94/201 Installing : perl-URI-5.27-3.el10.noarch 95/201 Installing : perl-Text-Tabs+Wrap-2024.001-511.el10.noarch 96/201 Installing : perl-Time-Local-2:1.350-511.el10.noarch 97/201 Installing : perl-Mozilla-CA-20231213-5.el10.noarch 98/201 Installing : perl-if-0.61.000-512.2.el10_0.noarch 99/201 Installing : perl-locale-1.12-512.2.el10_0.noarch 100/201 Installing : perl-Pod-Escapes-1:1.07-511.el10.noarch 101/201 Installing : perl-File-Path-2.18-511.el10.noarch 102/201 Installing : perl-IO-Socket-SSL-2.085-3.el10.noarch 103/201 Installing : perl-Net-SSLeay-1.94-8.el10.x86_64 104/201 Installing : perl-Term-ANSIColor-5.01-512.el10.noarch 105/201 Installing : perl-Class-Struct-0.68-512.2.el10_0.noarch 106/201 Installing : perl-POSIX-2.20-512.2.el10_0.x86_64 107/201 Installing : perl-IPC-Open3-1.22-512.2.el10_0.noarch 108/201 Installing : perl-Term-Cap-1.18-511.el10.noarch 109/201 Installing : perl-Pod-Simple-1:3.45-511.el10.noarch 110/201 Installing : perl-File-Temp-1:0.231.100-512.el10.noarch 111/201 Installing : perl-HTTP-Tiny-0.088-512.el10.noarch 112/201 Installing : perl-Socket-4:2.038-511.el10.x86_64 113/201 Installing : perl-SelectSaver-1.02-512.2.el10_0.noarch 114/201 Installing : perl-Symbol-1.09-512.2.el10_0.noarch 115/201 Installing : perl-File-stat-1.14-512.2.el10_0.noarch 116/201 Installing : perl-podlators-1:5.01-511.el10.noarch 117/201 Installing : perl-Pod-Perldoc-3.28.01-512.el10.noarch 118/201 Installing : perl-Text-ParseWords-3.31-511.el10.noarch 119/201 Installing : perl-Fcntl-1.18-512.2.el10_0.x86_64 120/201 Installing : perl-base-2.27-512.2.el10_0.noarch 121/201 Installing : perl-mro-1.29-512.2.el10_0.x86_64 122/201 Installing : perl-IO-1.55-512.2.el10_0.x86_64 123/201 Installing : perl-overloading-0.02-512.2.el10_0.noarch 124/201 Installing : perl-Pod-Usage-4:2.03-511.el10.noarch 125/201 Installing : perl-Scalar-List-Utils-5:1.63-511.el10.x86_64 126/201 Installing : perl-constant-1.33-512.el10.noarch 127/201 Installing : perl-MIME-Base64-3.16-511.el10.x86_64 128/201 Installing : perl-parent-1:0.241-512.el10.noarch 129/201 Installing : perl-Errno-1.38-512.2.el10_0.x86_64 130/201 Installing : perl-File-Basename-2.86-512.2.el10_0.noarch 131/201 Installing : perl-Getopt-Std-1.14-512.2.el10_0.noarch 132/201 Installing : perl-Storable-1:3.32-511.el10.x86_64 133/201 Installing : perl-overload-1.37-512.2.el10_0.noarch 134/201 Installing : perl-vars-1.05-512.2.el10_0.noarch 135/201 Installing : perl-Getopt-Long-1:2.58-3.el10.noarch 136/201 Installing : perl-Exporter-5.78-511.el10.noarch 137/201 Installing : perl-Carp-1.54-511.el10.noarch 138/201 Installing : perl-PathTools-3.91-512.el10.x86_64 139/201 Installing : perl-DynaLoader-1.56-512.2.el10_0.x86_64 140/201 Installing : perl-Encode-4:3.21-511.el10.x86_64 141/201 Installing : perl-libs-4:5.40.2-512.2.el10_0.x86_64 142/201 Installing : perl-interpreter-4:5.40.2-512.2.el10_0.x86_64 143/201 Installing : perl-Error-1:0.17029-18.el10.noarch 144/201 Installing : perl-TermReadKey-2.38-24.el10.x86_64 145/201 Installing : perl-File-Find-1.44-512.2.el10_0.noarch 146/201 Installing : perl-lib-0.65-512.2.el10_0.x86_64 147/201 Installing : mpdecimal-2.5.1-12.el10.x86_64 148/201 Installing : python3-libs-3.12.11-3.el10.x86_64 149/201 Installing : python3-3.12.11-3.el10.x86_64 150/201 Installing : cmake-rpm-macros-3.30.5-3.el10_0.noarch 151/201 Installing : cmake-data-3.30.5-3.el10_0.noarch 152/201 Installing : cmake-3.30.5-3.el10_0.x86_64 153/201 Installing : rocm-llvm-20-9.rocm7.1.1.el10.x86_64 154/201 Installing : rocm-llvm-devel-20-9.rocm7.1.1.el10.x86_64 155/201 Running scriptlet: rocm-llvm-devel-20-9.rocm7.1.1.el10.x86_64 155/201 Installing : rocm-llvm-static-20-9.rocm7.1.1.el10.x86_64 156/201 Installing : libpipeline-1.5.7-7.el10.x86_64 157/201 Running scriptlet: man-db-2.12.0-10.el10.x86_64 158/201 Installing : man-db-2.12.0-10.el10.x86_64 158/201 Running scriptlet: man-db-2.12.0-10.el10.x86_64 158/201 Installing : environment-modules-5.3.1-8.el10.x86_64 159/201 Running scriptlet: environment-modules-5.3.1-8.el10.x86_64 159/201 Installing : libcbor-0.11.0-3.el10.x86_64 160/201 Installing : libfido2-1.14.0-7.el10.x86_64 161/201 Installing : openssh-clients-9.9p1-12.el10_1.x86_64 162/201 Running scriptlet: openssh-clients-9.9p1-12.el10_1.x86_64 162/201 Installing : git-core-2.47.3-1.el10_0.x86_64 163/201 Installing : git-core-doc-2.47.3-1.el10_0.noarch 164/201 Installing : perl-Git-2.47.3-1.el10_0.noarch 165/201 Installing : git-2.47.3-1.el10_0.x86_64 166/201 Running scriptlet: dbus-common-1:1.14.10-5.el10.noarch 167/201 Creating group 'dbus' with GID 81. Creating user 'dbus' (System Message Bus) with UID 81 and GID 81. Installing : dbus-common-1:1.14.10-5.el10.noarch 167/201 Running scriptlet: dbus-common-1:1.14.10-5.el10.noarch 167/201 Running scriptlet: dbus-broker-36-4.el10.x86_64 168/201 Installing : dbus-broker-36-4.el10.x86_64 168/201 Running scriptlet: dbus-broker-36-4.el10.x86_64 168/201 Installing : dbus-1:1.14.10-5.el10.x86_64 169/201 Installing : systemd-pam-257-13.el10.x86_64 170/201 Running scriptlet: systemd-257-13.el10.x86_64 171/201 Creating group 'systemd-journal' with GID 190. Installing : systemd-257-13.el10.x86_64 171/201 Running scriptlet: systemd-257-13.el10.x86_64 171/201 Creating group 'input' with GID 104. Creating group 'kvm' with GID 36. Creating group 'render' with GID 105. Creating group 'sgx' with GID 106. Running scriptlet: logrotate-3.22.0-4.el10.x86_64 172/201 Installing : logrotate-3.22.0-4.el10.x86_64 172/201 Running scriptlet: logrotate-3.22.0-4.el10.x86_64 172/201 Created symlink '/etc/systemd/system/timers.target.wants/logrotate.timer' → '/usr/lib/systemd/system/logrotate.timer'. Running scriptlet: munge-0.5.15-10.el10.x86_64 173/201 Creating group 'munge' with GID 998. Creating user 'munge' (Runs Uid 'N' Gid Emporium) with UID 998 and GID 998. Installing : munge-0.5.15-10.el10.x86_64 173/201 Running scriptlet: munge-0.5.15-10.el10.x86_64 173/201 Installing : torque-libs-6.1.3-16.el10.x86_64 174/201 Installing : prrte-libs-3.0.2-9.el10.x86_64 175/201 Installing : prrte-3.0.2-9.el10.x86_64 176/201 Installing : rocm-clang-runtime-devel-20-9.rocm7.1.1.el10.x86 177/201 Installing : rocm-clang-20-9.rocm7.1.1.el10.x86_64 178/201 Installing : rocm-clang-devel-20-9.rocm7.1.1.el10.x86_64 179/201 Installing : rocm-device-libs-20-9.rocm7.1.1.el10.x86_64 180/201 Installing : hipcc-20-9.rocm7.1.1.el10.x86_64 181/201 Installing : rocm-hip-6.4.2-1.el10_1.x86_64 182/201 Running scriptlet: rocm-hip-6.4.2-1.el10_1.x86_64 182/201 Installing : rocblas-6.4.2-7.el10_1.x86_64 183/201 Running scriptlet: rocblas-6.4.2-7.el10_1.x86_64 183/201 Installing : rocsolver-6.4.2-2.el10_1.x86_64 184/201 Running scriptlet: rocsolver-6.4.2-2.el10_1.x86_64 184/201 Installing : hipblas-6.4.1-2.el10_1.x86_64 185/201 Running scriptlet: hipblas-6.4.1-2.el10_1.x86_64 185/201 Installing : rocm-comgr-devel-20-9.rocm7.1.1.el10.x86_64 186/201 Installing : rocm-hip-devel-6.4.2-1.el10_1.x86_64 187/201 Installing : rocblas-devel-6.4.2-7.el10_1.x86_64 188/201 Installing : hipblas-devel-6.4.1-2.el10_1.x86_64 189/201 Installing : hipcc-libomp-devel-20-9.rocm7.1.1.el10.x86_64 190/201 Installing : openmpi-2:5.0.2-5.el10.x86_64 191/201 Installing : rocm-rpm-macros-6.4.2-1.el10_1.noarch 192/201 Installing : libcurl-devel-8.12.1-2.el10.x86_64 193/201 Installing : wget-1.24.5-5.el10.x86_64 194/201 Installing : gcc-c++-14.3.1-2.1.el10.x86_64 195/201 Installing : annobin-plugin-gcc-12.99-1.el10.x86_64 196/201 Running scriptlet: annobin-plugin-gcc-12.99-1.el10.x86_64 196/201 Installing : gcc-plugin-annobin-14.3.1-2.1.el10.x86_64 197/201 Running scriptlet: gcc-plugin-annobin-14.3.1-2.1.el10.x86_64 197/201 Installing : pthreadpool-devel-0.0^git20230829.4fe0e1e-7.el10 198/201 Installing : langpacks-en-4.1-3.el10.noarch 199/201 Installing : xxd-2:9.1.083-6.el10_1.x86_64 200/201 Installing : systemd-rpm-macros-257-13.el10.noarch 201/201 Running scriptlet: systemd-rpm-macros-257-13.el10.noarch 201/201 Installed products updated. Installed: annobin-docs-12.99-1.el10.noarch annobin-plugin-gcc-12.99-1.el10.x86_64 brotli-1.1.0-6.el10.x86_64 brotli-devel-1.1.0-6.el10.x86_64 clang-resource-filesystem-20.1.8-1.el10.x86_64 cmake-3.30.5-3.el10_0.x86_64 cmake-data-3.30.5-3.el10_0.noarch cmake-filesystem-3.30.5-3.el10_0.x86_64 cmake-rpm-macros-3.30.5-3.el10_0.noarch cpp-14.3.1-2.1.el10.x86_64 dbus-1:1.14.10-5.el10.x86_64 dbus-broker-36-4.el10.x86_64 dbus-common-1:1.14.10-5.el10.noarch default-fonts-core-sans-4.1-3.el10.noarch emacs-filesystem-1:29.4-12.el10.noarch environment-modules-5.3.1-8.el10.x86_64 expat-2.7.1-1.el10_1.3.x86_64 fonts-filesystem-1:2.0.5-18.el10.noarch gcc-14.3.1-2.1.el10.x86_64 gcc-c++-14.3.1-2.1.el10.x86_64 gcc-plugin-annobin-14.3.1-2.1.el10.x86_64 git-2.47.3-1.el10_0.x86_64 git-core-2.47.3-1.el10_0.x86_64 git-core-doc-2.47.3-1.el10_0.noarch glibc-devel-2.39-58.el10_1.2.x86_64 gnutls-3.8.10-2.el10.x86_64 google-noto-fonts-common-20240401-5.el10.noarch google-noto-sans-mono-vf-fonts-20240401-5.el10.noarch google-noto-sans-vf-fonts-20240401-5.el10.noarch google-noto-serif-vf-fonts-20240401-5.el10.noarch groff-base-1.23.0-10.el10.x86_64 hipblas-6.4.1-2.el10_1.x86_64 hipblas-common-devel-6.4.0-1.el10_1.noarch hipblas-devel-6.4.1-2.el10_1.x86_64 hipcc-20-9.rocm7.1.1.el10.x86_64 hipcc-libomp-devel-20-9.rocm7.1.1.el10.x86_64 hwdata-0.379-10.6.el10.noarch hwloc-libs-2.11.1-3.el10.x86_64 jsoncpp-1.9.5-9.el10.x86_64 kernel-headers-6.12.0-124.21.1.el10_1.x86_64 keyutils-libs-devel-1.6.3-5.el10.x86_64 krb5-devel-1.21.3-8.el10_0.x86_64 langpacks-core-en-4.1-3.el10.noarch langpacks-en-4.1-3.el10.noarch langpacks-fonts-en-4.1-3.el10.noarch less-661-3.el10.x86_64 libcbor-0.11.0-3.el10.x86_64 libcom_err-devel-1.47.1-4.el10.x86_64 libcurl-devel-8.12.1-2.el10.x86_64 libdrm-2.4.123-1.el10.x86_64 libedit-3.1-52.20230828cvs.el10.x86_64 libfabric-2.1.0-1.el10.x86_64 libfido2-1.14.0-7.el10.x86_64 libgfortran-14.3.1-2.1.el10.x86_64 libibverbs-57.0-2.el10.x86_64 libidn2-devel-2.3.7-3.el10.x86_64 libkadm5-1.21.3-8.el10_0.x86_64 libmpc-1.3.1-7.el10.x86_64 libnghttp2-devel-1.64.0-2.el10.x86_64 libnl3-3.11.0-1.el10.x86_64 libomp-20.1.8-1.el10.x86_64 libomp-devel-20.1.8-1.el10.x86_64 libpciaccess-0.16-16.el10.x86_64 libpipeline-1.5.7-7.el10.x86_64 libpsl-devel-0.21.5-6.el10.x86_64 libquadmath-14.3.1-2.1.el10.x86_64 librdmacm-57.0-2.el10.x86_64 libseccomp-2.5.6-1.el10.x86_64 libselinux-devel-3.9-1.el10.x86_64 libsepol-devel-3.9-1.el10.x86_64 libssh-devel-0.11.1-5.el10_1.x86_64 libstdc++-devel-14.3.1-2.1.el10.x86_64 libuv-1:1.51.0-1.el10_0.x86_64 libverto-devel-0.3.2-10.el10.x86_64 libxcrypt-devel-4.4.36-10.el10.x86_64 llvm-filesystem-20.1.8-1.el10.x86_64 llvm-libs-20.1.8-1.el10.x86_64 logrotate-3.22.0-4.el10.x86_64 make-1:4.4.1-9.el10.x86_64 man-db-2.12.0-10.el10.x86_64 mpdecimal-2.5.1-12.el10.x86_64 munge-0.5.15-10.el10.x86_64 munge-libs-0.5.15-10.el10.x86_64 ncurses-6.4-14.20240127.el10.x86_64 numactl-libs-2.0.19-2.el10.x86_64 ocl-icd-2.3.2-8.el10.x86_64 openmpi-2:5.0.2-5.el10.x86_64 openssh-9.9p1-12.el10_1.x86_64 openssh-clients-9.9p1-12.el10_1.x86_64 openssl-devel-1:3.5.1-4.el10_1.x86_64 pcre2-devel-10.44-1.el10.3.x86_64 pcre2-utf16-10.44-1.el10.3.x86_64 pcre2-utf32-10.44-1.el10.3.x86_64 perl-AutoLoader-5.74-512.2.el10_0.noarch perl-B-1.89-512.2.el10_0.x86_64 perl-Carp-1.54-511.el10.noarch perl-Class-Struct-0.68-512.2.el10_0.noarch perl-Data-Dumper-2.189-512.el10.x86_64 perl-Digest-1.20-511.el10.noarch perl-Digest-MD5-2.59-6.el10.x86_64 perl-DynaLoader-1.56-512.2.el10_0.x86_64 perl-Encode-4:3.21-511.el10.x86_64 perl-Errno-1.38-512.2.el10_0.x86_64 perl-Error-1:0.17029-18.el10.noarch perl-Exporter-5.78-511.el10.noarch perl-Fcntl-1.18-512.2.el10_0.x86_64 perl-File-Basename-2.86-512.2.el10_0.noarch perl-File-Find-1.44-512.2.el10_0.noarch perl-File-Path-2.18-511.el10.noarch perl-File-Temp-1:0.231.100-512.el10.noarch perl-File-stat-1.14-512.2.el10_0.noarch perl-FileHandle-2.05-512.2.el10_0.noarch perl-Getopt-Long-1:2.58-3.el10.noarch perl-Getopt-Std-1.14-512.2.el10_0.noarch perl-Git-2.47.3-1.el10_0.noarch perl-HTTP-Tiny-0.088-512.el10.noarch perl-IO-1.55-512.2.el10_0.x86_64 perl-IO-Socket-IP-0.42-512.el10.noarch perl-IO-Socket-SSL-2.085-3.el10.noarch perl-IPC-Open3-1.22-512.2.el10_0.noarch perl-MIME-Base64-3.16-511.el10.x86_64 perl-Mozilla-CA-20231213-5.el10.noarch perl-Net-SSLeay-1.94-8.el10.x86_64 perl-POSIX-2.20-512.2.el10_0.x86_64 perl-PathTools-3.91-512.el10.x86_64 perl-Pod-Escapes-1:1.07-511.el10.noarch perl-Pod-Perldoc-3.28.01-512.el10.noarch perl-Pod-Simple-1:3.45-511.el10.noarch perl-Pod-Usage-4:2.03-511.el10.noarch perl-Scalar-List-Utils-5:1.63-511.el10.x86_64 perl-SelectSaver-1.02-512.2.el10_0.noarch perl-Socket-4:2.038-511.el10.x86_64 perl-Storable-1:3.32-511.el10.x86_64 perl-Symbol-1.09-512.2.el10_0.noarch perl-Term-ANSIColor-5.01-512.el10.noarch perl-Term-Cap-1.18-511.el10.noarch perl-TermReadKey-2.38-24.el10.x86_64 perl-Text-ParseWords-3.31-511.el10.noarch perl-Text-Tabs+Wrap-2024.001-511.el10.noarch perl-Time-Local-2:1.350-511.el10.noarch perl-URI-5.27-3.el10.noarch perl-base-2.27-512.2.el10_0.noarch perl-constant-1.33-512.el10.noarch perl-if-0.61.000-512.2.el10_0.noarch perl-interpreter-4:5.40.2-512.2.el10_0.x86_64 perl-lib-0.65-512.2.el10_0.x86_64 perl-libnet-3.15-512.el10.noarch perl-libs-4:5.40.2-512.2.el10_0.x86_64 perl-locale-1.12-512.2.el10_0.noarch perl-mro-1.29-512.2.el10_0.x86_64 perl-overload-1.37-512.2.el10_0.noarch perl-overloading-0.02-512.2.el10_0.noarch perl-parent-1:0.241-512.el10.noarch perl-podlators-1:5.01-511.el10.noarch perl-vars-1.05-512.2.el10_0.noarch pmix-4.2.8-8.el10.x86_64 procps-ng-4.0.4-8.el10.x86_64 prrte-3.0.2-9.el10.x86_64 prrte-libs-3.0.2-9.el10.x86_64 pthreadpool-0.0^git20230829.4fe0e1e-7.el10_1.x86_64 pthreadpool-devel-0.0^git20230829.4fe0e1e-7.el10_1.x86_64 publicsuffix-list-20240107-5.el10.noarch python3-3.12.11-3.el10.x86_64 python3-libs-3.12.11-3.el10.x86_64 python3-pip-wheel-23.3.2-7.el10.noarch redhat-mono-vf-fonts-4.1.0-1.el10.noarch redhat-text-vf-fonts-4.1.0-1.el10.noarch rocblas-6.4.2-7.el10_1.x86_64 rocblas-devel-6.4.2-7.el10_1.x86_64 rocm-clang-20-9.rocm7.1.1.el10.x86_64 rocm-clang-devel-20-9.rocm7.1.1.el10.x86_64 rocm-clang-libs-20-9.rocm7.1.1.el10.x86_64 rocm-clang-runtime-devel-20-9.rocm7.1.1.el10.x86_64 rocm-comgr-20-9.rocm7.1.1.el10.x86_64 rocm-comgr-devel-20-9.rocm7.1.1.el10.x86_64 rocm-device-libs-20-9.rocm7.1.1.el10.x86_64 rocm-hip-6.4.2-1.el10_1.x86_64 rocm-hip-devel-6.4.2-1.el10_1.x86_64 rocm-libc++-20-9.rocm7.1.1.el10.x86_64 rocm-libc++-devel-20-9.rocm7.1.1.el10.x86_64 rocm-lld-20-9.rocm7.1.1.el10.x86_64 rocm-llvm-20-9.rocm7.1.1.el10.x86_64 rocm-llvm-devel-20-9.rocm7.1.1.el10.x86_64 rocm-llvm-filesystem-20-9.rocm7.1.1.el10.x86_64 rocm-llvm-libs-20-9.rocm7.1.1.el10.x86_64 rocm-llvm-static-20-9.rocm7.1.1.el10.x86_64 rocm-rpm-macros-6.4.2-1.el10_1.noarch rocm-runtime-6.4.2-1.el10_1.x86_64 rocm-runtime-devel-6.4.2-1.el10_1.x86_64 rocsolver-6.4.2-2.el10_1.x86_64 systemd-257-13.el10.x86_64 systemd-pam-257-13.el10.x86_64 systemd-rpm-macros-257-13.el10.noarch tcl-1:8.6.13-4.el10.x86_64 torque-libs-6.1.3-16.el10.x86_64 tzdata-2025c-1.el10.noarch ucx-1.18.1-1.el10.x86_64 vim-filesystem-2:9.1.083-6.el10_1.noarch wget-1.24.5-5.el10.x86_64 xxd-2:9.1.083-6.el10_1.x86_64 zlib-ng-compat-devel-2.2.3-2.el10.x86_64 Complete! Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1766188800 Wrote: /builddir/build/SRPMS/llama-cpp-b6153-1.el10.src.rpm No matches found for the following disable plugin patterns: local, spacewalk, versionlock Updating Subscription Management repositories. Unable to read consumer identity This system is not registered with an entitlement server. You can use subscription-manager to register. Copr repository 77 kB/s | 1.5 kB 00:00 Red Hat Enterprise Linux 10 for x86_64 - BaseOS 41 kB/s | 4.1 kB 00:00 Red Hat Enterprise Linux 10 for x86_64 - AppStr 43 kB/s | 4.1 kB 00:00 Red Hat CodeReady Linux Builder for RHEL 10 x86 47 kB/s | 4.0 kB 00:00 Extra Packages for Enterprise Linux 10 - x86_64 112 kB/s | 33 kB 00:00 Package cmake-3.30.5-3.el10_0.x86_64 is already installed. Package curl-8.12.1-2.el10.x86_64 is already installed. Package gcc-c++-14.3.1-2.1.el10.x86_64 is already installed. Package git-2.47.3-1.el10_0.x86_64 is already installed. Package hipblas-devel-6.4.1-2.el10_1.x86_64 is already installed. Package hipcc-libomp-devel-20-9.rocm7.1.1.el10.x86_64 is already installed. Package langpacks-en-4.1-3.el10.noarch is already installed. Package libcurl-devel-8.12.1-2.el10.x86_64 is already installed. Package openmpi-2:5.0.2-5.el10.x86_64 is already installed. Package pthreadpool-devel-0.0^git20230829.4fe0e1e-7.el10_1.x86_64 is already installed. Package rocblas-devel-6.4.2-7.el10_1.x86_64 is already installed. Package rocm-comgr-devel-20-9.rocm7.1.1.el10.x86_64 is already installed. Package rocm-hip-devel-6.4.2-1.el10_1.x86_64 is already installed. Package rocm-rpm-macros-6.4.2-1.el10_1.noarch is already installed. Package rocm-runtime-devel-6.4.2-1.el10_1.x86_64 is already installed. Package wget-1.24.5-5.el10.x86_64 is already installed. Package xxd-2:9.1.083-6.el10_1.x86_64 is already installed. Dependencies resolved. Nothing to do. Complete! Finish: build setup for llama-cpp-b6153-1.el10.src.rpm Start: rpmbuild llama-cpp-b6153-1.el10.src.rpm Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1766188800 Executing(%prep): /bin/sh -e /var/tmp/rpm-tmp.NI7RiK + umask 022 + cd /builddir/build/BUILD + cd /builddir/build/BUILD + rm -rf llama.cpp-b6153 + /usr/lib/rpm/rpmuncompress -x /builddir/build/SOURCES/llama.cpp-b6153.tar.gz + STATUS=0 + '[' 0 -ne 0 ']' + cd llama.cpp-b6153 + rm -rf /builddir/build/BUILD/llama.cpp-b6153-SPECPARTS + /usr/bin/mkdir -p /builddir/build/BUILD/llama.cpp-b6153-SPECPARTS + /usr/bin/chmod -Rf a+rX,u+w,g-w,o-w . + sed -i -e 's/POSITION_INDEPENDENT_CODE ON/POSITION_INDEPENDENT_CODE ON SOVERSION b6153/' src/CMakeLists.txt + sed -i -e 's/POSITION_INDEPENDENT_CODE ON/POSITION_INDEPENDENT_CODE ON SOVERSION b6153/' ggml/src/CMakeLists.txt + sed -i -e 's/POSITION_INDEPENDENT_CODE ON/POSITION_INDEPENDENT_CODE ON SOVERSION b6153/' tools/mtmd/CMakeLists.txt + sed -i '/target_link_libraries(ggml-hip PRIVATE ggml-base.*/aset_target_properties(ggml-hip PROPERTIES SOVERSION b6153)' ggml/src/ggml-hip/CMakeLists.txt + sed -i '/target_compile_features(${GGML_CPU_NAME} PRIVATE c_std_11.*/aset_target_properties(${GGML_CPU_NAME} PROPERTIES SOVERSION b6153)' ggml/src/ggml-cpu/CMakeLists.txt + sed -i '/#include ' src/llama-mmap.h + rm -rf exmples/llma.android + find . -name .gitignore -exec rm -rf '{}' ';' + RPM_EC=0 ++ jobs -p + exit 0 Executing(%build): /bin/sh -e /var/tmp/rpm-tmp.CNUA8I + umask 022 + cd /builddir/build/BUILD + CFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection ' + export CFLAGS + CXXFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection' + export CXXFLAGS + FFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + cd llama.cpp-b6153 + CFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection ' + export CFLAGS + CXXFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection' + export CXXFLAGS + FFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + /usr/bin/cmake -S . -B redhat-linux-build -DCMAKE_C_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_CXX_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_Fortran_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_VERBOSE_MAKEFILE:BOOL=ON -DCMAKE_INSTALL_DO_STRIP:BOOL=OFF -DCMAKE_INSTALL_PREFIX:PATH=/usr -DINCLUDE_INSTALL_DIR:PATH=/usr/include -DLIB_INSTALL_DIR:PATH=/usr/lib64 -DSYSCONF_INSTALL_DIR:PATH=/etc -DSHARE_INSTALL_PREFIX:PATH=/usr/share -DLIB_SUFFIX=64 -DBUILD_SHARED_LIBS:BOOL=ON -DCMAKE_INSTALL_LIBDIR=lib64 -DCMAKE_SKIP_RPATH=ON -DGGML_AVX=OFF -DGGML_AVX2=OFF -DGGML_AVX512=OFF -DGGML_AVX512_VBMI=OFF -DGGML_AVX512_VNNI=OFF -DGGML_FMA=OFF -DGGML_F16C=OFF -DGGML_HIP=ON '-DAMDGPU_TARGETS=gfx900;gfx906:xnack-;gfx908:xnack-;gfx90a:xnack+;gfx90a:xnack-;gfx942;gfx950;gfx1010;gfx1012;gfx1030;gfx1031;gfx1035;gfx1100;gfx1101;gfx1102;gfx1103;gfx1150;gfx1151;gfx1152;gfx1153;gfx1200;gfx1201' -DLLAMA_BUILD_EXAMPLES=OFF -DLLAMA_BUILD_TESTS=OFF -- The C compiler identification is Clang 20.0.0 -- The CXX compiler identification is Clang 20.0.0 -- Detecting C compiler ABI info -- Detecting C compiler ABI info - done -- Check for working C compiler: /usr/bin/hipcc - skipped -- Detecting C compile features -- Detecting C compile features - done -- Detecting CXX compiler ABI info -- Detecting CXX compiler ABI info - done -- Check for working CXX compiler: /usr/bin/hipcc - skipped -- Detecting CXX compile features -- Detecting CXX compile features - done -- Found Git: /usr/bin/git (found version "2.47.3") fatal: not a git repository (or any of the parent directories): .git fatal: not a git repository (or any of the parent directories): .git sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory -- Setting GGML_NATIVE_DEFAULT to OFF -- Performing Test CMAKE_HAVE_LIBC_PTHREAD -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success -- Found Threads: TRUE -- Warning: ccache not found - consider installing it for faster compilation or disable this warning with GGML_CCACHE=OFF -- CMAKE_SYSTEM_PROCESSOR: x86_64 -- GGML_SYSTEM_ARCH: x86 -- Including CPU backend -- Could NOT find OpenMP_C (missing: OpenMP_C_FLAGS OpenMP_C_LIB_NAMES) -- Could NOT find OpenMP_CXX (missing: OpenMP_CXX_FLAGS OpenMP_CXX_LIB_NAMES) -- Could NOT find OpenMP (missing: OpenMP_C_FOUND OpenMP_CXX_FOUND) CMake Warning at ggml/src/ggml-cpu/CMakeLists.txt:80 (message): OpenMP not found Call Stack (most recent call first): ggml/src/CMakeLists.txt:372 (ggml_add_cpu_backend_variant_impl) -- x86 detected -- Adding CPU backend variant ggml-cpu: CMake Warning at ggml/src/ggml-hip/CMakeLists.txt:27 (message): Setting hipcc as the C++ compiler is legacy behavior. Prefer setting the HIP compiler directly. See README for details. CMake Warning (dev) at /usr/lib64/cmake/hip/hip-config-amd.cmake:70 (message): AMDGPU_TARGETS is deprecated. Please use GPU_TARGETS instead. Call Stack (most recent call first): /usr/lib64/cmake/hip/hip-config.cmake:159 (include) ggml/src/ggml-hip/CMakeLists.txt:39 (find_package) This warning is for project developers. Use -Wno-dev to suppress it. -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS - Success -- HIP and hipBLAS found -- Including HIP backend -- ggml version: 0.0.0 -- ggml commit: unknown CMake Warning at common/CMakeLists.txt:32 (message): Git repository not found; to enable automatic generation of build info, make sure Git is installed and the project is a Git repository. -- Found CURL: /usr/lib64/libcurl.so (found version "8.12.1") -- Configuring done (5.6s) -- Generating done (0.1s) CMake Warning: Manually-specified variables were not used by the project: CMAKE_Fortran_FLAGS_RELEASE CMAKE_INSTALL_DO_STRIP INCLUDE_INSTALL_DIR LIB_INSTALL_DIR LIB_SUFFIX SHARE_INSTALL_PREFIX SYSCONF_INSTALL_DIR -- Build files have been written to: /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build + /usr/bin/cmake --build redhat-linux-build -j4 --verbose Change Dir: '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' Run Build Command(s): /usr/bin/cmake -E env VERBOSE=1 /usr/bin/gmake -f Makefile -j4 /usr/bin/cmake -S/builddir/build/BUILD/llama.cpp-b6153 -B/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build --check-build-system CMakeFiles/Makefile.cmake 0 /usr/bin/cmake -E cmake_progress_start /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/CMakeFiles /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build//CMakeFiles/progress.marks /usr/bin/gmake -f CMakeFiles/Makefile2 all gmake[1]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f ggml/src/CMakeFiles/ggml-base.dir/build.make ggml/src/CMakeFiles/ggml-base.dir/depend /usr/bin/gmake -f common/CMakeFiles/build_info.dir/build.make common/CMakeFiles/build_info.dir/depend /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-llava-cli.dir/build.make tools/mtmd/CMakeFiles/llama-llava-cli.dir/depend /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-gemma3-cli.dir/build.make tools/mtmd/CMakeFiles/llama-gemma3-cli.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/common /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common/CMakeFiles/build_info.dir/DependInfo.cmake "--color=" cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/ggml/src /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/CMakeFiles/ggml-base.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd/CMakeFiles/llama-llava-cli.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd/CMakeFiles/llama-gemma3-cli.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f common/CMakeFiles/build_info.dir/build.make common/CMakeFiles/build_info.dir/build gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-llava-cli.dir/build.make tools/mtmd/CMakeFiles/llama-llava-cli.dir/build /usr/bin/gmake -f ggml/src/CMakeFiles/ggml-base.dir/build.make ggml/src/CMakeFiles/ggml-base.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-gemma3-cli.dir/build.make tools/mtmd/CMakeFiles/llama-gemma3-cli.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 1%] Building CXX object tools/mtmd/CMakeFiles/llama-llava-cli.dir/deprecation-warning.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/mtmd/CMakeFiles/llama-llava-cli.dir/deprecation-warning.cpp.o -MF CMakeFiles/llama-llava-cli.dir/deprecation-warning.cpp.o.d -o CMakeFiles/llama-llava-cli.dir/deprecation-warning.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/deprecation-warning.cpp [ 1%] Building C object ggml/src/CMakeFiles/ggml-base.dir/ggml.c.o [ 1%] Building CXX object common/CMakeFiles/build_info.dir/build-info.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -DNDEBUG -std=gnu11 -fPIC -Wshadow -Wstrict-prototypes -Wpointer-arith -Wmissing-prototypes -Werror=implicit-int -Werror=implicit-function-declaration -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wdouble-promotion -MD -MT ggml/src/CMakeFiles/ggml-base.dir/ggml.c.o -MF CMakeFiles/ggml-base.dir/ggml.c.o.d -o CMakeFiles/ggml-base.dir/ggml.c.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml.c cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/build_info.dir/build-info.cpp.o -MF CMakeFiles/build_info.dir/build-info.cpp.o.d -o CMakeFiles/build_info.dir/build-info.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common/build-info.cpp [ 2%] Building CXX object tools/mtmd/CMakeFiles/llama-gemma3-cli.dir/deprecation-warning.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/mtmd/CMakeFiles/llama-gemma3-cli.dir/deprecation-warning.cpp.o -MF CMakeFiles/llama-gemma3-cli.dir/deprecation-warning.cpp.o.d -o CMakeFiles/llama-gemma3-cli.dir/deprecation-warning.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/deprecation-warning.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 3%] Building CXX object ggml/src/CMakeFiles/ggml-base.dir/ggml.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-base.dir/ggml.cpp.o -MF CMakeFiles/ggml-base.dir/ggml.cpp.o.d -o CMakeFiles/ggml-base.dir/ggml.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 3%] Built target build_info [ 3%] Building C object ggml/src/CMakeFiles/ggml-base.dir/ggml-alloc.c.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -DNDEBUG -std=gnu11 -fPIC -Wshadow -Wstrict-prototypes -Wpointer-arith -Wmissing-prototypes -Werror=implicit-int -Werror=implicit-function-declaration -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wdouble-promotion -MD -MT ggml/src/CMakeFiles/ggml-base.dir/ggml-alloc.c.o -MF CMakeFiles/ggml-base.dir/ggml-alloc.c.o.d -o CMakeFiles/ggml-base.dir/ggml-alloc.c.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-alloc.c sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 3%] Linking CXX executable ../../bin/llama-gemma3-cli cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-gemma3-cli.dir/link.txt --verbose=1 [ 3%] Linking CXX executable ../../bin/llama-llava-cli cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-llava-cli.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-gemma3-cli.dir/deprecation-warning.cpp.o" -o ../../bin/llama-gemma3-cli /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-llava-cli.dir/deprecation-warning.cpp.o" -o ../../bin/llama-llava-cli sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] : warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 3%] Built target llama-gemma3-cli [ 3%] Built target llama-llava-cli /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-minicpmv-cli.dir/build.make tools/mtmd/CMakeFiles/llama-minicpmv-cli.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd/CMakeFiles/llama-minicpmv-cli.dir/DependInfo.cmake "--color=" [ 3%] Building CXX object ggml/src/CMakeFiles/ggml-base.dir/ggml-backend.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-base.dir/ggml-backend.cpp.o -MF CMakeFiles/ggml-base.dir/ggml-backend.cpp.o.d -o CMakeFiles/ggml-base.dir/ggml-backend.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-backend.cpp gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-minicpmv-cli.dir/build.make tools/mtmd/CMakeFiles/llama-minicpmv-cli.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 4%] Building CXX object tools/mtmd/CMakeFiles/llama-minicpmv-cli.dir/deprecation-warning.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/mtmd/CMakeFiles/llama-minicpmv-cli.dir/deprecation-warning.cpp.o -MF CMakeFiles/llama-minicpmv-cli.dir/deprecation-warning.cpp.o.d -o CMakeFiles/llama-minicpmv-cli.dir/deprecation-warning.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/deprecation-warning.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-qwen2vl-cli.dir/build.make tools/mtmd/CMakeFiles/llama-qwen2vl-cli.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd/CMakeFiles/llama-qwen2vl-cli.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-qwen2vl-cli.dir/build.make tools/mtmd/CMakeFiles/llama-qwen2vl-cli.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 4%] Building CXX object tools/mtmd/CMakeFiles/llama-qwen2vl-cli.dir/deprecation-warning.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/mtmd/CMakeFiles/llama-qwen2vl-cli.dir/deprecation-warning.cpp.o -MF CMakeFiles/llama-qwen2vl-cli.dir/deprecation-warning.cpp.o.d -o CMakeFiles/llama-qwen2vl-cli.dir/deprecation-warning.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/deprecation-warning.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 4%] Linking CXX executable ../../bin/llama-minicpmv-cli cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-minicpmv-cli.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-minicpmv-cli.dir/deprecation-warning.cpp.o" -o ../../bin/llama-minicpmv-cli sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 4%] Built target llama-minicpmv-cli [ 5%] Building CXX object ggml/src/CMakeFiles/ggml-base.dir/ggml-opt.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-base.dir/ggml-opt.cpp.o -MF CMakeFiles/ggml-base.dir/ggml-opt.cpp.o.d -o CMakeFiles/ggml-base.dir/ggml-opt.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-opt.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 6%] Linking CXX executable ../../bin/llama-qwen2vl-cli cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-qwen2vl-cli.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-qwen2vl-cli.dir/deprecation-warning.cpp.o" -o ../../bin/llama-qwen2vl-cli sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 6%] Built target llama-qwen2vl-cli [ 6%] Building CXX object ggml/src/CMakeFiles/ggml-base.dir/ggml-threading.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-base.dir/ggml-threading.cpp.o -MF CMakeFiles/ggml-base.dir/ggml-threading.cpp.o.d -o CMakeFiles/ggml-base.dir/ggml-threading.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-threading.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 7%] Building C object ggml/src/CMakeFiles/ggml-base.dir/ggml-quants.c.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -DNDEBUG -std=gnu11 -fPIC -Wshadow -Wstrict-prototypes -Wpointer-arith -Wmissing-prototypes -Werror=implicit-int -Werror=implicit-function-declaration -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wdouble-promotion -MD -MT ggml/src/CMakeFiles/ggml-base.dir/ggml-quants.c.o -MF CMakeFiles/ggml-base.dir/ggml-quants.c.o.d -o CMakeFiles/ggml-base.dir/ggml-quants.c.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-quants.c sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 7%] Building CXX object ggml/src/CMakeFiles/ggml-base.dir/gguf.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BUILD -DGGML_COMMIT=\"unknown\" -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_VERSION=\"0.0.0\" -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_base_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-base.dir/gguf.cpp.o -MF CMakeFiles/ggml-base.dir/gguf.cpp.o.d -o CMakeFiles/ggml-base.dir/gguf.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/gguf.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 8%] Linking CXX shared library ../../bin/libggml-base.so cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/cmake -E cmake_link_script CMakeFiles/ggml-base.dir/link.txt --verbose=1 /usr/bin/hipcc -fPIC -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,libggml-base.so.b6153 -o ../../bin/libggml-base.so.b6153 "CMakeFiles/ggml-base.dir/ggml.c.o" "CMakeFiles/ggml-base.dir/ggml.cpp.o" "CMakeFiles/ggml-base.dir/ggml-alloc.c.o" "CMakeFiles/ggml-base.dir/ggml-backend.cpp.o" "CMakeFiles/ggml-base.dir/ggml-opt.cpp.o" "CMakeFiles/ggml-base.dir/ggml-threading.cpp.o" "CMakeFiles/ggml-base.dir/ggml-quants.c.o" "CMakeFiles/ggml-base.dir/gguf.cpp.o" -lm sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/cmake -E cmake_symlink_library ../../bin/libggml-base.so.b6153 ../../bin/libggml-base.so.b6153 ../../bin/libggml-base.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 8%] Built target ggml-base /usr/bin/gmake -f ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/build.make ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/depend /usr/bin/gmake -f ggml/src/CMakeFiles/ggml-cpu.dir/build.make ggml/src/CMakeFiles/ggml-cpu.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/ggml/src /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/CMakeFiles/ggml-cpu.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f ggml/src/CMakeFiles/ggml-cpu.dir/build.make ggml/src/CMakeFiles/ggml-cpu.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/build.make ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 8%] Building C object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.c.o [ 9%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.cpp.o [ 9%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/repack.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/ggml-cpu.cpp cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -DNDEBUG -std=gnu11 -fPIC -Wshadow -Wstrict-prototypes -Wpointer-arith -Wmissing-prototypes -Werror=implicit-int -Werror=implicit-function-declaration -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wdouble-promotion -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.c.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.c.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.c.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/ggml-cpu.c cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/repack.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/repack.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/repack.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/repack.cpp [ 10%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/acc.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/acc.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/acc.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/acc.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 10%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/add-id.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/add-id.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/add-id.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/add-id.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx1010. [ 11%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/arange.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/arange.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/arange.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/arange.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu [ 12%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/hbm.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/hbm.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/hbm.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/hbm.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/hbm.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1012. 1 warning generated when compiling for gfx1010. [ 12%] Building C object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/quants.c.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -DNDEBUG -std=gnu11 -fPIC -Wshadow -Wstrict-prototypes -Wpointer-arith -Wmissing-prototypes -Werror=implicit-int -Werror=implicit-function-declaration -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wdouble-promotion -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/quants.c.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/quants.c.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/quants.c.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/quants.c sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. [ 13%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/traits.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/traits.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/traits.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/traits.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/traits.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1030. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ [ 13%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/amx.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/amx.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/amx.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/amx.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/amx/amx.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory 4 warnings generated when compiling for gfx1031. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1035. 1 warning generated when compiling for gfx1031. [ 14%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/mmq.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/mmq.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/mmq.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/mmq.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/amx/mmq.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1100. 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. [ 14%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/binary-ops.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/binary-ops.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/binary-ops.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/binary-ops.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/binary-ops.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx1101. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx1102. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ [ 15%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/unary-ops.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/unary-ops.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/unary-ops.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/unary-ops.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/unary-ops.cpp 1 warning generated when compiling for gfx1103. sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1103. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1150. 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. [ 15%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/vec.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/vec.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/vec.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/vec.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/vec.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1151. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1152. 1 warning generated when compiling for gfx1151. [ 16%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ops.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/ops.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/ops.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/ops.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/ops.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx1153. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 1 warning generated when compiling for gfx1153. 4 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 1 warning generated when compiling for gfx1200. 4 warnings generated when compiling for gfx1201. [ 16%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/llamafile/sgemm.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/llamafile/sgemm.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/llamafile/sgemm.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/llamafile/sgemm.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/llamafile/sgemm.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 1 warning generated when compiling for gfx1201. 4 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx906. [ 16%] Building C object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/quants.c.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -DNDEBUG -std=gnu11 -fPIC -Wshadow -Wstrict-prototypes -Wpointer-arith -Wmissing-prototypes -Werror=implicit-int -Werror=implicit-function-declaration -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wdouble-promotion -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/quants.c.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/quants.c.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/quants.c.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/arch/x86/quants.c sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 17%] Building CXX object ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/repack.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU_REPACK -DGGML_USE_LLAMAFILE -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_cpu_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/repack.cpp.o -MF CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/repack.cpp.o.d -o CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/repack.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cpu/arch/x86/repack.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 1 warning generated when compiling for gfx908. 4 warnings generated when compiling for gfx90a. [ 17%] Linking CXX shared library ../../bin/libggml-cpu.so cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/cmake -E cmake_link_script CMakeFiles/ggml-cpu.dir/link.txt --verbose=1 /usr/bin/hipcc -fPIC -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,libggml-cpu.so.b6153 -o ../../bin/libggml-cpu.so.b6153 "CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.c.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/ggml-cpu.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/repack.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/hbm.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/quants.c.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/traits.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/amx.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/amx/mmq.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/binary-ops.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/unary-ops.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/vec.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/ops.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/llamafile/sgemm.cpp.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/quants.c.o" "CMakeFiles/ggml-cpu.dir/ggml-cpu/arch/x86/repack.cpp.o" ../../bin/libggml-base.so.b6153 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/acc.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx950. [ 17%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/argmax.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/argmax.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/argmax.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/argmax.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:14:44: warning: cast from 'const int *' to 'char *' drops const qualifier [-Wcast-qual] 14 | const int i11 = *(int32_t *) ((char *) src2 + i1*sizeof(int32_t) + i2*nb21); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:20:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 20 | const float * src0_row = (const float *)((char *)src0 + i1*nb01 + i2*nb02); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/add-id.cu:21:54: warning: cast from 'const float *' to 'char *' drops const qualifier [-Wcast-qual] 21 | const float * src1_row = (const float *)((char *)src1 + i11*nb11); | ^ 4 warnings generated when compiling for host. [ 18%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/argsort.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/argsort.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/argsort.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/argsort.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/cmake -E cmake_symlink_library ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-cpu.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 18%] Built target ggml-cpu [ 18%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/binbcast.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/binbcast.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/binbcast.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/binbcast.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/arange.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 19%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/clamp.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/clamp.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/clamp.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/clamp.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1010. 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. 1 warning generated when compiling for gfx1035. 2 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1101. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1102. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1150. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const 1int cc) { | ^ warning generated when compiling for gfx900. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. 2 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1152. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. 2 warnings generated when compiling for gfx1200. 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cu:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argmax.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 19%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/concat.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/concat.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/concat.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/concat.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/argsort.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 20%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv-transpose-1d.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv-transpose-1d.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv-transpose-1d.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv-transpose-1d.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu 2 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/clamp.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 20%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-dw.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-dw.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-dw.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-dw.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1010. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 1 warning generated when compiling for gfx1012. 2 warnings generated when compiling for gfx1012. 2 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx942. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx950. 2 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/binbcast.cu:361:11: warning: 'break' will never be executed [-Wunreachable-code-break] 361 | } break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. 2 warnings generated when compiling for host. [ 21%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-transpose.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-transpose.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-transpose.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-transpose.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 1 warning generated when compiling for gfx908. 2 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 1 warning generated when compiling for gfx90a. 2 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 1 warning generated when compiling for gfx90a. 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv-transpose-1d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 21%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/convert.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/convert.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/convert.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/convert.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1150. 2 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/concat.cu:218:17: warning: 'break' will never be executed [-Wunreachable-code-break] 218 | break; | ^~~~~ 2 warnings generated when compiling for gfx1151. 2 warnings generated when compiling for host. [ 22%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/count-equal.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/count-equal.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/count-equal.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/count-equal.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-dw.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. 4 warnings generated when compiling for gfx1010. [ 22%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/cpy.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/cpy.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/cpy.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/cpy.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1152. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1012. 4 warnings generated when compiling for gfx1012. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1200. In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 2 warnings generated when compiling for gfx1030. 4 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1035. 4 warnings generated when compiling for gfx1031. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1035. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1101. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ 2 warnings generated when compiling for gfx90a. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1102. 4 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1150. 4 warnings generated when compiling for gfx1101. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:17: warning: no previous prototype for function 'conv2d_transpose_kernel' [-Wmissing-prototypes] 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/conv2d-transpose.cu:6:12: note: declare 'static' if the function is not intended to be used outside of this translation unit 6 | __global__ void conv2d_transpose_kernel(const float * __restrict__ input, const half * __restrict__ kernel, | ^ | static 2 warnings generated when compiling for host. [ 23%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/cross-entropy-loss.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/cross-entropy-loss.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/cross-entropy-loss.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/cross-entropy-loss.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 4 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1152. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 1 warning generated when compiling for gfx1010. 2 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 1 warning generated when compiling for gfx1012. 2 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx1201. 1 warning generated when compiling for gfx1030. 4 warnings generated when compiling for gfx1150. 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx900. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1151. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx906. 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx908. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1152. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx90a. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx1153. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx942. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/count-equal.cu:62:13: warning: 'break' will never be executed [-Wunreachable-code-break] 62 | break; | ^~~~~ 2 warnings generated when compiling for host. [ 23%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/diagmask.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/diagmask.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/diagmask.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/diagmask.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx1201. 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx906. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. 4 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 4 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: 4 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ 1 warning generated when compiling for gfx1151. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ 4 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. 1 warning generated when compiling for gfx1152. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:504:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 504 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:44:31: warning: cast from 'const void *' to 'int *' drops const qualifier [-Wcast-qual] 44 | const int * x0 = ((int *) vx) + blockIdx.x * nint; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/convert.cu:507:9: note: in instantiation of function template specialization 'dequantize_block_q8_0_f16' requested here 507 | dequantize_block_q8_0_f16<<>>(vx, y, k); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cross-entropy-loss.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. 4 warnings generated when compiling for host. [ 23%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu 1 warning generated when compiling for host. [ 24%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f32.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f32.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f32.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f32.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cpy.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 24%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-wmma-f16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-wmma-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-wmma-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-wmma-f16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ 11 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 7 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1012. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ 11 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. 7 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/diagmask.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 7 warnings generated when compiling for gfx1031. 1 warning generated when compiling for host. [ 25%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1010. 11 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 22 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1030. 1 warning generated when compiling for gfx1031. 7 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ 11 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t 7ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ warnings generated when compiling for gfx1101. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 1 warning generated when compiling for gfx1035. 22 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 11 warnings generated when compiling for gfx1035. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1103. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 7 warnings generated when compiling for gfx1150. 22 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1151. 11 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1152. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1153. 7 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 7 warnings generated when compiling for gfx1153. 22 warnings generated when compiling for gfx1201. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.de/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuhv:i436c:e96].: wawarning: rfunction 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]p _size; | 436 ^~~~~~~~~ | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, ha/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.culf:2673>:13 &: warning: B'break' will never be executed [-Wunreachable-code-break]) { | 673 ^ | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 22 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 20 warnings generated when compiling for gfx908. 1 warning generated when compiling for gfx1103. 7 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 20 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 20 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B)/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16,/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 24 warnings generated when compiling for gfx942. 1 warning generated when compiling for gfx1150. 7 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 22 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:6: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn.cu:117:13: warning: 'break' will never be executed [-Wunreachable-code-break] 117 | break; | ^~~~~ 22 warnings generated when compiling for host. [ 25%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/getrows.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/getrows.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/getrows.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/getrows.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu 7 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ 1 warning generated when compiling for gfx1151. 2 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 7 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 2 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 7 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1030. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 2 warnings generated when compiling for gfx1031. 7 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ 2 warnings generated when compiling for gfx1035. 1 warning generated when compiling for gfx1153. 7 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:32:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 32 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:542:15: warning: unused variable 'warp_size' [-Wunused-variable] 542 | const int warp_size = ggml_cuda_info().devices[ctx.device].warp_size; | ^~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:673:13: warning: 'break' will never be executed [-Wunreachable-code-break] 673 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:646:17: warning: 'break' will never be executed [-Wunreachable-code-break] 646 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:593:21: warning: 'break' will never be executed [-Wunreachable-code-break] 593 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-wmma-f16.cu:568:21: warning: 'break' will never be executed [-Wunreachable-code-break] 568 | break; | ^~~~~ 7 warnings generated when compiling for host. [ 26%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/ggml-cuda.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/ggml-cuda.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/ggml-cuda.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/ggml-cuda.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 11 warnings generated when compiling for gfx1152. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 2 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1010. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 2 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ 2 warnings generated when compiling for gfx1103. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 27 warnings generated when compiling for gfx1031. 2 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | In file included from tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1035. 2 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1152. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 27 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 2 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 2 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 1 warning generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 27 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 2 warnings generated when compiling for gfx900. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 2 warnings generated when compiling for gfx906. 1 warning generated when compiling for gfx908. 27 warnings generated when compiling for gfx1152. 11 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 2 warnings generated when compiling for gfx908. 27 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 27 warnings generated when compiling for gfx1201. 1 warning generated when compiling for gfx90a. 2 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 27 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 2 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 2 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 25 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/getrows.cu:231:13: warning: 'break' will never be executed [-Wunreachable-code-break] 231 | break; | ^~~~~ 1 warning generated when compiling for gfx90a. 2 warnings generated when compiling for host. [ 26%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/gla.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/gla.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/gla.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/gla.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | 11 warnings generated when compiling for gfx908. break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 25 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 1 warning generated when compiling for gfx1010. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 25 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ 1 warning generated when compiling for gfx1012. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 29 warnings generated when compiling for gfx942. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ 27 warnings generated when compiling for gfx950. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:26: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 142 | char archName[archLen + 1]; | ^~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:142:19: note: read of non-const variable 'archLen' is not allowed in a constant expression /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:141:9: note: declared here 141 | int archLen = strlen(devName); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3422:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3422 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3415:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3415 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3412:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3412 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3403:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3403 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3396:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3396 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3391:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3391 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3386:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3386 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3381:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3381 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3334:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3334 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3326:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3326 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3322:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3322 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3303:15: warning: 'break' will never be executed [-Wunreachable-code-break] 3303 | } break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3238:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3238 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ggml-cuda.cu:3225:13: warning: 'break' will never be executed [-Wunreachable-code-break] 3225 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1031. 27 warnings generated when compiling for host. [ 27%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/im2col.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/im2col.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/im2col.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/im2col.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f32.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. 11 warnings generated when compiling for gfx90a. 1 warning generated when compiling for host. [ 27%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mean.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mean.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/mean.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/mean.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1010. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 1 warning generated when compiling for gfx1150. 3 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ | ^ 3 warnings generated when compiling for gfx1101. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 11 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:23: warning: unused parameter 'ne00' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:26:83: warning: unused parameter 'ne03' [-Wunused-parameter] 26 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:23: warning: unused parameter 'ne10' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:28:83: warning: unused parameter 'ne13' [-Wunused-parameter] 28 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:43: warning: unused parameter 'nb21' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:63: warning: unused parameter 'nb22' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:30:83: warning: unused parameter 'nb23' [-Wunused-parameter] 30 | const int32_t nb21, const int32_t nb22, const int64_t nb23, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:43: warning: unused parameter 'ne31' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:31:63: warning: unused parameter 'ne32' [-Wunused-parameter] 31 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/fattn-tile-f16.cu:32:63: warning: unused parameter 'nb32' [-Wunused-parameter] 32 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1103. 1 warning generated when compiling for gfx1152. 11 warnings generated when compiling for host. [ 28%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmf.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmf.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmf.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmf.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1151. 13 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ 3 warnings generated when compiling for gfx1152. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1012. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1201. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1200. 13 warnings generated when compiling for gfx1030. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<1In file included from 6, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ _avai/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ lable(const /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuhint cc) { | ^ :480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ 1 warning generated when compiling for gfx908. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx900. 1 warning generated when compiling for gfx906. 13 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_1 warning generated when compiling for gfx90a. SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx908. 13 warnings generated when compiling for gfx1035. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx90a. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ 1 warning generated when compiling for gfx90a. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/im2col.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 28%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmq.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmq.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmq.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmq.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu 13 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mean.cu:62:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 62 | reduce_rows_f32<<>>(src0_d, dst_d, ncols); | ^ 3 warnings generated when compiling for host. [ 29%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvf.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvf.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvf.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvf.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu 13 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1012. 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/gla.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1030. 1 warning generated when compiling for host. [ 29%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvq.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvq.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvq.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvq.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu 13 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 13 warnings generated when compiling for gfx1031. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1035. 13 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1100. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ 13 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1102. 13 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1150. 13 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ 13 warnings generated when compiling for gfx1153. 13 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx1201. 13 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx906. 13 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ :383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 1 warning generated when compiling for gfx1012. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ 13 warnings generated when compiling for gfx950. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cu:67:13: warning: 'break' will never be executed [-Wunreachable-code-break] 67 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ 13 warnings generated when compiling for host. [ 30%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/norm.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/norm.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/norm.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/norm.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. 13 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:356:26: warning: unused variable 'prec' [-Wunused-variable] 356 | const enum ggml_prec prec = fast_fp16_available(cc) ? ggml_prec(dst->op_params[0]) : GGML_PREC_F32; | ^~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmf.cu:154:32: warning: unused typedef 'tile_C' [-Wunused-local-typedef] 154 | typedef tile<16, 8, float> tile_C; | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. 13 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for host. [ 30%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/opt-step-adamw.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/opt-step-adamw.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/opt-step-adamw.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/opt-step-adamw.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/norm.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 31%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/out-prod.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/out-prod.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/out-prod.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/out-prod.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/opt-step-adamw.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 31%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/pad.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/pad.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/pad.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/pad.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. 1 warning generated when compiling for gfx1010. 13 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 1 warning generated when compiling for gfx1012. 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1150. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1030. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/out-prod.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for host. [ 32%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/pool2d.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/pool2d.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/pool2d.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/pool2d.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu 13 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pad.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 32%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/quantize.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/quantize.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/quantize.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/quantize.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1010. 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1035. 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 13 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu::11: : In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh::13: : /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh::270270::4242:: warning: warning: unused parameter 'cc' [-Wunused-parameter]unused parameter 'cc' [-Wunused-parameter] 270270 | | ssttaattiicc bbooooll ffpp1166__mmmmaa__aavvaaiillaabbllee((ccoonnsstt iinntt cccc)) {{ | | ^ ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 1 warning generated when compiling for gfx1151. 13 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1102. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 13 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1200. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 1 warning generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1201. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/pool2d.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^[ 32%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/roll.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/roll.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/roll.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/roll.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for gfx950. 13 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cu:188:13: warning: 'break' will never be executed [-Wunreachable-code-break] 188 | break; | ^~~~~ 13 warnings generated when compiling for host. [ 33%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/rope.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/rope.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/rope.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/rope.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 13 warnings generated when compiling for gfx1102. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/roll.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/../ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 33%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/scale.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/scale.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/scale.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/scale.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu 13 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 13 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/scale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 34%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/set-rows.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/set-rows.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/set-rows.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/set-rows.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx950. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/rope.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 34%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/softcap.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/softcap.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/softcap.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/softcap.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softcap.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 35%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/softmax.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/softmax.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/softmax.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/softmax.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/set-rows.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 35%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-conv.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-conv.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-conv.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-conv.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. 1 warning generated when compiling for gfx1035. 13 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvf.cu:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for host. [ 36%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-scan.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-scan.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-scan.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-scan.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1150. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-conv.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 36%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/sum.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/sum.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/sum.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/sum.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/softmax.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for host. [ 37%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/sumrows.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/sumrows.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/sumrows.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/sumrows.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sum.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 37%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/tsembd.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/tsembd.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/tsembd.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/tsembd.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1103. 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 1 warning generated when compiling for gfx1012. 3 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1151. 13 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ 1 warning generated when compiling for gfx90a. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 1 warning generated when compiling for gfx1031. 3 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 1 warning generated when compiling for gfx1035. 3 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1200. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx1201. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx900. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1103. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 3 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1150. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx908. 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx90a. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cu:10: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/ssm-scan.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 38%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/unary.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/unary.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/unary.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/unary.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { 1 | ^ warning generated when compiling for gfx1010. 13 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/reduce_rows.cuh:42:21: warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] 42 | if (lane_id < (blockDim.x / WARP_SIZE)) { | ~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/sumrows.cu:10:9: note: in instantiation of function template specialization 'reduce_rows_f32' requested here 10 | reduce_rows_f32<<>>(x, dst, ncols); | ^ 3 warnings generated when compiling for host. [ 38%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/upscale.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/upscale.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/upscale.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/upscale.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1031. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1102. 13 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/tsembd.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 39%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/wkv.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/wkv.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/wkv.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/wkv.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 1 warning generated when compiling for gfx1010. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/upscale.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 39%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/unary.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 40%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu.o 1 warning generated when compiling for gfx1103. cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. 14 warnings generated when compiling for gfx1030. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1031. 14 warnings generated when compiling for gfx1012. 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 1 warning generated when compiling for gfx1153. 14 warnings generated when compiling for gfx1035. 14 warnings generated when compiling for gfx1102. 13 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. 1 warning generated when compiling for gfx1200. 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. 14 warnings generated when compiling for gfx1101. 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const iIn file included from nt cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 1 warning generated when compiling for gfx900. 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 1 warning generated when compiling for gfx908. 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx1152. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 1 warning generated when compiling for gfx942. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx90a. 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/wkv.cu:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 16 warnings generated when compiling for gfx942. 1 warning generated when compiling for host. [ 40%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D,In file included from const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8,/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ :383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8,/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh half2> :356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, co&n A, const tile<16, 8, half2> & B) { | ^ s/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ t/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ tile/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ <8, 8, /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhhalf2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ :383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1010. 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_availab:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] l 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] e(const int cc) { | ^ 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 41%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1030. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 13 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A,In file included from const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: :463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 14 warnings generated when compiling for gfx1031. 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1035. 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for host. [ 41%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1100. 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ :544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<1In file included from 6, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __for/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhceinline__ void cp_async_wait_all() { | ^ :383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32In file included from , 32/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu, int:>3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ & D, const tile<32, /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh4, in:t326:90: >warning: & A, function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]const 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ tile<32, 4, int> &/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh B) { :356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_allIn file included from () { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:In file included from 97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, h4a,l fi2n>t >& &B )A ,{ c | ^o nst tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh419:96::326 :warning: 90:function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419326 | | ttiillee<<1166,, 88,, filnota>t >& &D ,D ,c ocnosnts tt itlielo a&t >A ,& cAo,n scto ntsitl etf l&o aBt)> {& B| ) ^ { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh : 436 : 96 : warning: tfunction 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]i le<16, 4, half2> & 436D | , c o n s t t i l e l o&a tA>, &c oDn,s tc otnislte h a&l fB2)> {& A| , ^ const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhwarning: :function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]383 :97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | 383 | t i l e < 1 6t,i l8e,< 1f6l,o a8t,> h&a lDf,2 >c o&n sDt, tciolnes> && AA,, ccoonnsstt ttiillee<<186,, 88,, nhva_lbff2l>o a&t 1B6)2 >{ & | B ^) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | ti/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhle:<4801:698,: 8warning: ,function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] float> & D, const t480i | l e < 1 6 , 8 , f l otaitl>e <&1 6A,, 1c6o,n sftl otaitl>e <&8 ,D ,8 ,c ofnlsota tt>i l&e & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhwarning: :function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]436 :96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | 436 | t i l e < 1 6t,i l1e6<,1 6i,n t8>, &f lDo,a tc>o n&s tD ,t icloen 8&, Ah,a lcfo2n>s t& tAi,l ec, &8 ,B )h a{l f 2| > ^ & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544::46392::110 :warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | 463 | t i l etl o&a tD>, &c oDn,s tc otnislte , &n vA_,b fcloonastt1 6t2i>l e&< 3A2,, c4o,n sitn tt>i l&e & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1151. 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh :| 419 ^: 96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. 14 warnings generated when compiling for gfx1152. 14 warnings generated when compiling for gfx1035. In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu::33: : In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh::11: : /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh::270270::4242:: warning: warning: unused parameter 'cc' [-Wunused-parameter]unused parameter 'cc' [-Wunused-parameter] 270270 | | ssttaattiicc bbooooll ffpp1166__mmmmaa__aavvaaiillaabbllee((ccoonnsstt iinntt cccc)) {{ | | ^ ^ In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu::33: In file included from : /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuhIn file included from :/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh2:: 2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh::5151::6060:: warning: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn]function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 5151 | | ssttaattiicc ____ddeevviiccee____ ____ffoorrcceeiinnlliinnee____ vvooiidd ccpp__aassyynncc__wwaaiitt__aallll(()) {{ | | ^ ^ In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu::33: : In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh::33: : /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh::302302::9090:: warning: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302302 | | ttiillee<<1166,, 88,, iinntt>> && DD,, ccoonnsstt ttiillee<<1166,, 44,, iinntt>> && AA,, ccoonnsstt ttiillee<<88,, 44,, iinntt>> && BB)) {{ | | ^ ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh::326326::9090:: warning: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326326 | | ttiillee<<1166,, 88,, iinntt>> && DD,, ccoonnsstt ttiillee<<1166,, 88,, iinntt>> && AA,, ccoonnsstt ttiillee<<88,, 88,, iinntt>> && BB)) {{ | | ^ ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh::356356::9696:: warning: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356356 | | ttiillee<<1166,, 44,, hhaallff22>> && DD,, ccoonnsstt ttiillee<<1166,, 88,, hhaallff22>> && AA,, ccoonnsstt ttiillee<<88,, 88,, hhaallff22>> && BB)) {{ | | ^ ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh::383383::9797:: warning: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | 383 | t itliel2 >& &D ,D ,c ocnosnts tt itliel2 >& &A ,A ,c ocnosnts tt itliel2 >& &B )B ){ { | ^| ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh::419419::9696:: warning: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419419 | | ttiillee<<1166,, 88,, ffllooaatt>> && DD,, ccoonnsstt ttiillee<<1166,, 88,, ffllooaatt>> && AA,, ccoonnsstt ttiillee<<88,, 88,, ffllooaatt>> && BB)) {{ | | ^ ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh: :warning: 436:function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]96 : warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | 436 | t i l e l o&a tD>, &c oDn,s tc otnislte a l&f 2A>, &c oAn,s tc otnislte a l&f 2B>) &{ B )| ^{ | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh::463463::110110:: warning: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463463 | | ttiillee<<1166,, 88,, ffllooaatt>> && DD,, ccoonnsstt ttiillee<<1166,, 88,, nnvv__bbffllooaatt116622>> && AA,, ccoonnsstt ttiillee<<88,, 88,, nnvv__bbffllooaatt116622>> && BB)) {{ | | ^ ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh::480480::9898:: warning: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480480 | | ttiillee<<1166,, 1166,, ffllooaatt>> && DD,, ccoonnsstt ttiillee<<1166,, 88,, hhaallff22>> && AA,, ccoonnsstt ti lteif 2&> B&) B{) {| ^ | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | 516t | il e < 1 6 , 1 6 , i ntti>l e&< 1D6,, c1o6n,s ti ntti>l e&< 1D6,, c8o,n sitn tt>i l&e l e&< 1A6,, c8o,n sitn tt>i l&e & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh::544544::9292:: warning: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544544 | | ttiillee<<3322,, 3322,, iinntt>> && DD,, ccoonnsstt ttiillee<<3322,, 44,, iinntt>> && AA,, ccoonnsstt ttiillee<<3322,, 44,, iinntt>> && BB)) {{ | | ^ ^ In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu::33: : /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh::788788::4343:: warning: warning: unused parameter 'sinks_f' [-Wunused-parameter]unused parameter 'sinks_f' [-Wunused-parameter] 788788 | | ccoonnsstt ffllooaatt ** ccoonnsstt ____rreessttrriicctt____ ssiinnkkss__ff,, | | ^ ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35:/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh :warning: 1257unused parameter 'KV_max' [-Wunused-parameter]: 35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | c o1257n | s t i n t *c o_n_srte sitnrti c t*_ __ _KrVe_smtarxi,c t _| _ ^ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 1414 warnings generated when compiling for gfx1151. warnings generated when compiling for gfx1153. 14 warnings generated when compiling for gfx1100. In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu: :In file included from 3/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh: :In file included from 1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh1: :/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh270::42270:: 42warning: : warning: unused parameter 'cc' [-Wunused-parameter]unused parameter 'cc' [-Wunused-parameter] 270 | st a270t | isct abtoiocl bfopo1l6 _fmpm1a6__amvmaai_laavbaliel(acbolnes(tc oinnstt cicn)t {c c )| ^{ | ^ In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ & B) { | /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh ^ :383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ :326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ 326 | til/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhe<16:, 8,480 int:> & D98, :cons warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ t tile 516& A, :const92 til:e<8, 8, warning: int> function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]& 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ B) { /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh| ^ :544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. 14 warnings generated when compiling for gfx1200. 14 warnings generated when compiling for gfx1101. In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh | ^ :419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tilIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ e<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ :326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544::326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ 92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh : 356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhti:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ le<32, 32,/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ int> /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh& :436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ D, const til/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ e<32, 4, /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ int/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh> :516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ & A, con/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ st tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. 14 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx1150. 14 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] In file included from 1257 | /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. 12 warnings generated when compiling for gfx908. 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. 14 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. 12 warnings generated when compiling for gfx908. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1200. 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. 14 warnings generated when compiling for host. [ 41%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. 12 warnings generated when compiling for gfx908. 14 warnings generated when compiling for host. [ 42%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx1010. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1100. 14 warnings generated when compiling for gfx1030. 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 42%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1100. 14 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx1101. 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. 16 warnings generated when compiling for gfx942. 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1151. 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 43%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_availab/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhle(cons:544t int c:92: c) warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] { 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. 14 warnings generated when compiling for gfx1153. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. 14 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & DIn file included from , const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ bool fp16_mma_available(const int cc) { /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8| ^ (((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, In file included from half/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:2> & B) { 3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh | ^ :788:43/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh::463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx1201. 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. 14 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. 14 warnings generated when compiling for host. [ 43%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_aIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ vailable(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. 14 warnings generated when compiling for gfx1100. 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. 14 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nIn file included from v_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ :480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 14 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A14 warnings generated when compiling for gfx950. , const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 44%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. 14 warnings generated when compiling for gfx1153. 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. 14 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1200. 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. 14 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 14 warnings generated when compiling for gfx900. 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx1200. 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | In file included from tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh16_mma:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | _availabl tile<16, 8, int> & e(coD, nst int consct tic) { le<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 44%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu 14 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx1010. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: 14In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 45%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. 14 warnings generated when compiling for host. [ 45%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static booIn file included from l fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ , int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]:383 383 | :97 : warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] tile<16383 | , 8, half2> & D, constilt tie<16le<1, 8,6, 8 , half2> & A, const tile<16, 8, half2> & B) { | ^ half2> &/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half:2>419:96: warning: & function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] B) { 419 | | ^ tile<1/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 6, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. 14 warnings generated when compiling for gfx1010. 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. 14 warnings generated when compiling for gfx1012. 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. 14 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. 14 warnings generated when compiling for gfx1031. 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int14 warnings generated when compiling for gfx1152. > & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1150. 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] In file included from 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ > & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | ^ | /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhtile<16, 8:544,:92: intwarning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] >544 & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ | tile/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh<32, 32,:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] int356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ > & D, const tile<32, 4, int> & A, const tile<32, 4,/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh int> & B) { | ^ :383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1200. 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. 12 warnings generated when compiling for gfx908. 14 warnings generated when compiling for host. [ 46%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx90a. 14 warnings generated when compiling for gfx1012. 13 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:2: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/quantize.cuh:4: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/mmvq.cu:496:13: warning: 'break' will never be executed [-Wunreachable-code-break] 496 | break; | ^~~~~ 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 13 warnings generated when compiling for host. [ 46%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D,14 warnings generated when compiling for host. const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ [ 47%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 47%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu 14 warnings generated when compiling for gfx1010. 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, floatIn file included from > & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ : /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh::480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ warning: unused parameter 'cc' [-Wunused-parameter] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ 270 | s/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ tatic bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1012. 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | 14 tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ warnings generated when compiling for gfx1030. /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static In file included from __device__ __forceinline__ void cp_async_wait_all() { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 16/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ :419:/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ 96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ 419 | /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ tile<16/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ , 8, floa/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ t> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, consIn file included from t tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> In file included from & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile:<516:92:16, warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]4, int> & 516 | A, const til tilee<8, 4, int> <16& B) , 1{ 6, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | : 544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] tile<16, 8, in544 | t> & D, const t tilile<32e<, 3162, , int8,> & inD, cont> st t& ileA,<32, 4, i const nt> & Atile<8,, co 8, int> & nst tile<32, B) {4, int> & B) { | | ^ ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1031. 14 warnings generated when compiling for gfx1031. 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cuconst int cc) { | ^ :3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. 14 warnings generated when compiling for gfx1035. 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 1414 warnings generated when compiling for gfx1100. warnings generated when compiling for gfx1100. 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. 14 warnings generated when compiling for gfx1101. 14 warnings generated when compiling for gfx1101. 14 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> &In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh B) { | :383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> ^ & D, const /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhtile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ :326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh 326 | : 419t:96: warning: ifunction 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]le<1 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ 6, 8, int> & D, const tile<16, 8, int> & A, const tile & B) { :436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356, | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ nv_/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhbfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ :383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh 383 | t:ile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh tile:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | <1 tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ 6, 16, fl/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhoat> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ :436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ 436 | /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KVIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ _/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhmax, | ^ :383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. 1414 warnings generated when compiling for gfx1102. warnings generated when compiling for gfx1102. 14 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu::33: : In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh::11: : /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh::270270::4242:: warning: warning: unused parameter 'cc' [-Wunused-parameter]unused parameter 'cc' [-Wunused-parameter] 270270 | | ssttaattiicc bbooooll ffpp1166__mmmmaa__aavvaaiillaabbllee((ccoonnsstt iinntt cccc)) {{ | | ^ ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) {/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh: 436 :96| : ^ warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A,/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh c:o463n:s110t: twarning: ilfunction 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]e <8, 8, float> & B) 463{ | | ^ tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, h/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhal:f4802:>98 :& warning: B)function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] { | ^ 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh8,: 516n:v92_:b fwarning: lfunction 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]o at162> & A, cons t516 | t i l e < 8 , 8 , n vt_iblfel, &i nBt)> {& D| , ^ const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhfunction 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] :544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | 544 | t i l e < 1 6 ,t i1l6e,< 3f2l,o a3t2>, &i nDt,> c&o nDs,t ctoinlset< 1t6i,l e8<,3 2h,a l4f,2 >i n&t >A ,& cAo,n scto ntsitl ett >& &B )B ){ { | ^| ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu::33: : /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh::788788::4343:: warning: warning: unused parameter 'sinks_f' [-Wunused-parameter]unused parameter 'sinks_f' [-Wunused-parameter] 788788 | | ccoonnsstt ffllooaatt ** ccoonnsstt ____rreessttrriicctt____ ssiinnkkss__ff,, | | ^ ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh1257::125735::35 :warning: unused parameter 'KV_max' [-Wunused-parameter]warning: unused parameter 'KV_max' [-Wunused-parameter] 12571257 | | ccoonnsstt iinntt ** ____rreessttrriicctt____ KKVV__mmaaxx,, | | ^ ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. 1414 warnings generated when compiling for gfx1103. warnings generated when compiling for gfx1103. 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(conIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ st int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, conIn file included from st tile/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu<16, 4, int> & A, const tile<8, :4, i3nt>: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ 90:/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. 14 warnings generated when compiling for gfx1150. 14 warnings generated when compiling for gfx1150. 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ 326 | /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ :419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ | tile & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 8, float> &/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh D,: 419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ const tile<16, 8, /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhfloat> :& A,436 cons:96:t til e<8,warning: 8, function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]float 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ > & B) { /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ 480 | /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ :480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ 480 | /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh :544 :92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ 1257 | const /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. 1414 warnings generated when compiling for gfx1151. warnings generated when compiling for gfx1151. 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ :326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ 326 | /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ t/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ :419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ 419 | /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ tile<16, /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ 8, flo/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ at> & D,/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ :480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuhsinks_f, :1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | | ^ const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. 1414 warnings generated when compiling for gfx1152. warnings generated when compiling for gfx1152. 14 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, intIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ > /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ & D,/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh :383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ co/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhn:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ st til/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ e<1/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ 6, 4,/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ int/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ > &/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh436 | : 51 : 60 : warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] tile<16, 8, f l51o | astt>a t&i cD ,_ _cdoenvsitc et_i_l e_<_1f6o,r c8e,i nhlailnfe2_>_ &v oAi,d ccopn_sats ytnicl_ew | & ^ B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. 14 warnings generated when compiling for gfx1153. 14 warnings generated when compiling for gfx906. 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:In file included from 90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60:: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ 51 | static __device/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh__ :__f356:orce96inlin: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]e__ 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ void cp_async_wait_all/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh(): { 383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, In file included from half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ : /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. 14 warnings generated when compiling for gfx1200. 14 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx1201. 14 warnings generated when compiling for gfx900. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx900. 14 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int>In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, iIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ nt> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx908. 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tilIn file included from e<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:2: :383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | stat/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhic:419: __96: warning: devifunction 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ ce__ __forceinline__ void cp_a/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhsync_wait_all() {:436: 96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: In file included from function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8,In file included from int>/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu & D, const ti:3: In file included from l/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3e<16, 4, int> & : A, const tile<8, 4, /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90int> & B) { | ^ : warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ :326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh 326 | tile<16, 8, int> & D, const tile<16, 8, int>: & A326, c:ons90t t:ile<8 warning: , 8, infunction 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] t> & B) { 326 | ^ | til/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuhe:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ <16,/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh :383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ 8/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh, :480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 48%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | In file included from tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh: :419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:270:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ :42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | st/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuha:463tic b:110: ool warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]fp16_ 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ mma_available/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh(co:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ nst int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx90a. 16 warnings generated when compiling for gfx942. 14 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tiIn file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ l/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ e/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ 8, /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 16 warnings generated when compiling for gfx942. 16 warnings generated when compiling for gfx942. 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 48%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq1_s.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq1_s.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq1_s.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq1_s.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx950. 14 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu::33: : In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh::11: : /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh::270270::4242:: warning: warning: unused parameter 'cc' [-Wunused-parameter] unused parameter 'cc' [-Wunused-parameter] 270270 | | ssttaattiicc bbooooll ffpp1166__mmmmaa__aavvaaiillaabbllee((ccoonnsstt iinntt cccc)) {{ | | ^ ^ In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu::33: : In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh::22: : /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh::5151::6060:: warning: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn]function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 5151 | | ssttaattiicc ____ddeevviiccee____ ____ffoorrcceeiinnlliinnee____ vvooiidd ccpp__aassyynncc__wwaaiitt__aallll(()) {{ | | ^ ^ In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu::33: : In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh::33: : /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh::302302::9090:: warning: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302302 | | ttiillee<<1166,, 88,, iinntt>> && DD,, ccoonnsstt ttiillee<<1166,, 44,, iinntt>> && AA,, ccoonnsstt ttiillee<<88,, 44,, iinntt>> && BB)) {{ | | ^ ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh::326326::9090:: warning: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326326 | | ttiillee<<1166,, 88,, iinntt>> && DD,, ccoonnsstt ttiillee<<1166,, 88,, iinntt>> && AA,, ccoonnsstt ttiillee<<88,, 88,, iinntt>> && BB)) {{ | | ^ ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh: :warning: 356:function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]96 : warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | 356 | t i l e a l&f 2D>, &c oDn,s tc otnislte a l&f 2A>, &c oAn,s tc otnislte a l&f 2B>) &{ B )| ^{ | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh::383383::9797:: warning: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383383 | | ttiillee<<1166,, 88,, hhaallff22>> && DD,, ccoonsnst tt itlielf &2 >A ,& Aco,n scto nstitle t<1il6,e <186,, h8a,l fh2a>l f2&> B )& B{) {| ^ | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96 :419 | warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] tile<16 ,419 | 8 , f l o a t > & Dt,i lceo8 ,& fDl,o acto>n s&t At,i lceo, &f lAo,a tc>o n&s tB )t i{l e <| 8 ^, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh::436436::9696:: warning: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436436 | | ttiillee<<1166,, 88,, ffllooaatt>> && DD,, ccoonnsstt ttiillee<<1166,, 88,, hhaallff22>> && AA,, ccoonnsstt ttiillee<<88,, 88,, hhaallff22>> && BB)) {{ | | ^ ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh<1:6463,: 1108:, warning: ffunction 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]l oat> & D, const tile< 14636 | , 8 , n v _ b f l o atti1l6e2<>1 6&, A8,, cfolnosatt >t i&l eD<,8 ,c o8n,s tn vt_iblfel n&v _Bb)f l{o a t| 1 ^6 2> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh16:,480 :f98l:o awarning: t>function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] & D, const tile<16 ,480 | 8 , h a l f 2 > & At,i lceo, &h aDl,f 2c>o n&s tB )t i{l e <| 1 ^6 , 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh | : 516 : 92 : warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] tile<16, 16, i516n | t > & D , c o n s tt itliel> && DA,, ccoonnsstt ttiillee<<1166,, 88,, iinntt>> && AB,) c{o n s| t ^ tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh : 544 : 92 : warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] tile<32, 32, int> 544& | D , c o n s t t i lteii n&t >A ,& cDo,n scto ntsitl et i&n tB>) &{ A ,| ^c onst tile<32, 4, int> & B) { | ^ In file included from In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu::33: : /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh::788788::4343:: warning: warning: unused parameter 'sinks_f' [-Wunused-parameter]unused parameter 'sinks_f' [-Wunused-parameter] 788788 | | ccoonnsstt ffllooaatt ** ccoonnsstt ____rreessttrriicctt____ ssiinnkkss__ff,, | | ^ ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh::12571257::3535:: warning: warning: unused parameter 'KV_max' [-Wunused-parameter]unused parameter 'KV_max' [-Wunused-parameter] 12571257 | | ccoonnsstt iinntt ** ____rreessttrriicctt____ KKVV__mmaaxx,, | | ^ ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. 14 warnings generated when compiling for host. [ 49%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu.o [ 49%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_s.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_s.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_s.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_s.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu 14 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 14 warnings generated when compiling for gfx1103. 12 warnings generated when compiling for gfx1010. 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 14 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1153. 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx1012. 14 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 14 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 12 warnings generated when compiling for gfx1030. 14 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:2: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../cp-async.cuh:51:60: warning: function 'cp_async_wait_all' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 51 | static __device__ __forceinline__ void cp_async_wait_all() { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:788:43: warning: unused parameter 'sinks_f' [-Wunused-parameter] 788 | const float * const __restrict__ sinks_f, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-mma-f16.cuh:1257:35: warning: unused parameter 'KV_max' [-Wunused-parameter] 1257 | const int * __restrict__ KV_max, | ^ 14 warnings generated when compiling for host. [ 50%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh: 419 :c96o:n swarning: t function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn]i nt64_t * xs = (int6 4419_ | t * ) ( ( c o n s t tiinlte <*1)6 ,x s80, +f l(otahtr>e a&d IDd,x .cxo n%s tt .tIi)l e*< 1s6t,r i8d,e f+l o2a t*> (&t hAr,e acdoIndsxt. xt i/l et<.8I,) )8;, f| l ^o at> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3027:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 3027 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<e s&_ sDh,a rceodn,s ts ttrielaem<>1>6>, 8| , ^ half2> &/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh :A3660,: 13c:o nnote: sin instantiation of function template specialization 'launch_mul_mat_q' requested heret tile<8, 8, 3660h | a l f 2 > & B ) { l a| u ^n ch_mul_mat_q(ctx, args, stream); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. 11 warnings generated when compiling for gfx908. 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3027:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 3027 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3027:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 3027 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 11 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3027:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 3027 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 15 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2987:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2987 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2987:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2987 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 14 warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq1_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 50%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_s.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_s.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_s.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_s.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 14 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2987:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2987 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx950. 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 50%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2987:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2987 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. 12 warnings generated when compiling for host. [ 51%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 51%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bflIn file included from oat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16,/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu 8, int> & A, :const tile<3: 1/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh6, 8, int> & B): { 2849:19: warning: unused variable 'nwarps' [-Wunused-variable] | ^ 2849 | constexpr int/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. 12 warnings generated when compiling for gfx1103. 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. 12 warnings generated when compiling for gfx1103. 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3019:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3019 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3011:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3011 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3019:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3019 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1153. 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3011:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3011 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3019:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3019 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3011:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3011 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3035:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3035 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 11 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3011:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3011 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3019:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3019 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3035:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3035 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3043:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3043 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3043:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3043 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3035:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3035 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 52%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-mxfp4.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-mxfp4.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-mxfp4.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-mxfp4.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq3_s.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 52%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q2_k.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q2_k.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q2_k.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q2_k.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3043:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3043 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3035:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3035 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3043:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 3043 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 53%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q3_k.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q3_k.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q3_k.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q3_k.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 53%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_0.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_0.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_0.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_0.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2939:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2939 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2939:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2939 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2939:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2939 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2899:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_DS4>' requested here 2899 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2939:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2939 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 10 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2899:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_DS4>' requested here 2899 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2899:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_DS4>' requested here 2899 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 14 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-mxfp4.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for host. [ 54%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_1.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_1.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_1.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_1.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2899:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_DS4>' requested here 2899 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q3_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 54%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_k.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_k.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_k.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_k.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 15 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 55%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_0.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_0.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_0.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_0.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2907:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2907 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2907:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2907 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2907:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2907 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2907:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2907 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2915:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2915 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 11 warnings generated when compiling for gfx908. 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2915:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2915 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 55%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_1.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_1.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_1.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_1.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2915:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2915 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, false>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2963:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2963 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, true>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3578:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3578 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 13 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, false>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2963:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2963 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, true>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3578:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3578 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2915:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2915 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 13 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, false>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2963:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2963 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, true>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3578:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3578 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 13 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, false>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2963:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2963 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2962:54: note: in instantiation of function template specialization 'load_tiles_q4_K<128, true>' requested here 2962 | static constexpr load_tiles_mmq_t load_tiles = load_tiles_q4_K; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3578:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3578 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 10 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 17 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 56%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_k.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_k.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_k.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_k.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 14 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q4_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 56%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q6_k.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q6_k.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q6_k.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q6_k.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2923:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2923 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2923:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2923 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1152. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2923:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2923 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2923:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2923 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q2_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 57%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q8_0.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q8_0.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q8_0.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q8_0.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 15 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_1.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 57%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1031. 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ 12 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 28 warnings generated when compiling for gfx1030. 12 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2971:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2971 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 28 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2971:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2971 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 28 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx1200. 28 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2971:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2971 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 12 warnings generated when compiling for gfx1103. 28 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:992:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 992 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_1 + k0, MMQ_MMA_TILE_X_K_Q8_1); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2971:54: note: in instantiation of function template specialization 'vec_dot_q8_1_q8_1_mma<8, 128>' requested here 2971 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_1_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 28 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1103. 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for gfx900. 12 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ 28 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 10 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 28 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx950. 12 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q5_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 58%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 28 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx900. 12 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 14 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 28 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx900. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx90a. 12 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2931:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2931 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2931:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2931 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q6_k.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 58%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q4_0, GGML_TYPE_Q4_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 28 warnings generated when compiling for host. [ 59%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1010. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2931:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2931 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 28 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1012. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. 28 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:521:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 521 | acc[0] = __builtin_amdgcn_mfma_i32_16x16x32_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:522:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 522 | ((int64_t *) B.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:549:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 549 | acc[0] = __builtin_amdgcn_mfma_i32_32x32x16_i8(((int64_t *) A.x)[0], | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:550:69: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 550 | ((int64_t *) B.x)[0], | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:1796:31: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 1796 | for (int l = 0; l < sizeof(int); ++l) { | ~ ^ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:230:46: warning: cast from 'const int *' to 'long *' drops const qualifier [-Wcast-qual] 230 | const int64_t * xs = (int64_t *) ((const int *) xs0 + (threadIdx.x % t.I) * stride + 2 * (threadIdx.x / t.I)); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:824:13: note: in instantiation of function template specialization 'ggml_cuda_mma::load_generic<16, 8, int>' requested here 824 | load_generic(A[n], x_qs + (i0 + n*tile_A::I)*MMQ_MMA_TILE_X_K_Q8_0 + k0, MMQ_MMA_TILE_X_K_Q8_0); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2931:54: note: in instantiation of function template specialization 'vec_dot_q8_0_q8_1_mma<8, 128, MMQ_Q8_1_DS_LAYOUT_D4>' requested here 2931 | static constexpr vec_dot_mmq_t vec_dot_mma = vec_dot_q8_0_q8_1_mma; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3300:9: note: in instantiation of function template specialization 'mul_mat_q_process_tile' requested here 3300 | mul_mat_q_process_tile | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3571:13: note: in instantiation of function template specialization 'mul_mat_q' requested here 3571 | mul_mat_q<<>> | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3660:13: note: in instantiation of function template specialization 'launch_mul_mat_q' requested here 3660 | launch_mul_mat_q(ctx, args, stream); | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1035. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1100. 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 1 warning generated when compiling for gfx1100. 28 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1102. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1103. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 12 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:5: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:302:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 302 | tile<16, 8, int> & D, const tile<16, 4, int> & A, const tile<8, 4, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:326:90: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 326 | tile<16, 8, int> & D, const tile<16, 8, int> & A, const tile<8, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:356:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 356 | tile<16, 4, half2> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:383:97: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 383 | tile<16, 8, half2> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:419:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 419 | tile<16, 8, float> & D, const tile<16, 8, float> & A, const tile<8, 8, float> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:436:96: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 436 | tile<16, 8, float> & D, const tile<16, 8, half2> & A, const tile<8, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:463:110: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 463 | tile<16, 8, float> & D, const tile<16, 8, nv_bfloat162> & A, const tile<8, 8, nv_bfloat162> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:480:98: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 480 | tile<16, 16, float> & D, const tile<16, 8, half2> & A, const tile<16, 8, half2> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:516:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 516 | tile<16, 16, int> & D, const tile<16, 8, int> & A, const tile<16, 8, int> & B) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mma.cuh:544:92: warning: function 'mma' could be declared with attribute 'noreturn' [-Wmissing-noreturn] 544 | tile<32, 32, int> & D, const tile<32, 4, int> & A, const tile<32, 4, int> & B) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/mmq-instance-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../mmq.cuh:2849:19: warning: unused variable 'nwarps' [-Wunused-variable] 2849 | constexpr int nwarps = mmq_get_nwarps_device(); | ^~~~~~ 12 warnings generated when compiling for host. [ 59%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. 8 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1100. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for host. [ 59%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1151. 8 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx90a. 8 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 28 warnings generated when compiling for gfx950. 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:402:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 402 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:405:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 1, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 405 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:414:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 414 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:417:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 2, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 417 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:426:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 426 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:429:13: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 4, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 429 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:437:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, false>' requested here 437 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:125:37: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 125 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:373:35: note: in instantiation of function template specialization 'flash_attn_vec_ext_f16<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 373 | fattn_kernel_t fattn_kernel = flash_attn_vec_ext_f16; | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:440:9: note: in instantiation of function template specialization 'ggml_cuda_flash_attn_ext_vec_f16_case_impl<128, 8, GGML_TYPE_Q8_0, GGML_TYPE_Q8_0, true>' requested here 440 | ggml_cuda_flash_attn_ext_vec_f16_case_impl(ctx, dst); | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:138:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 138 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:151:33: warning: comparison of integers of different signs: 'int' and 'unsigned long' [-Wsign-compare] 151 | for (int i0 = 0; i0 < D/sizeof(int); i0 += WARP_SIZE) { | ~~ ^ ~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 28 warnings generated when compiling for host. [ 60%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu 8 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1201. 8 warnings generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 60%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu 8 warnings generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1101. 8 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1101. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1103. 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1150. 8 warnings generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1151. 1 warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1153. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx900. 8 warnings generated when compiling for host. [ 61%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne3In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx1200. 8 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 8 warnings generated when compiling for host. [ 61%] Building CXX object ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/hipcc -DGGML_BACKEND_BUILD -DGGML_BACKEND_SHARED -DGGML_HIP_NO_VMM -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_HIP -DUSE_PROF_API=1 -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -D__HIP_PLATFORM_AMD__=1 -Dggml_hip_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-hip/.. -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -x hip --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT ggml/src/ggml-hip/CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu.o -MF CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu.o.d -o CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. 8 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1103. 1 warning generated when compiling for gfx1010. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1012. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. 8 warnings generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1031. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1035. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for gfx1035. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx908. 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx90a. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1103. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1150. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for gfx950. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu:3: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:23: warning: unused parameter 'ne00' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:29:83: warning: unused parameter 'ne03' [-Wunused-parameter] 29 | const int32_t ne00, const int32_t ne01, const int32_t ne02, const int32_t ne03, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:23: warning: unused parameter 'ne10' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:31:83: warning: unused parameter 'ne13' [-Wunused-parameter] 31 | const int32_t ne10, const int32_t ne11, const int32_t ne12, const int32_t ne13, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:43: warning: unused parameter 'ne31' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:34:63: warning: unused parameter 'ne32' [-Wunused-parameter] 34 | const int32_t ne31, const int32_t ne32, const int32_t ne33, | ^ /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f16.cuh:35:63: warning: unused parameter 'nb32' [-Wunused-parameter] 35 | const int32_t nb31, const int32_t nb32, const int64_t nb33) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 8 warnings generated when compiling for host. 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1151. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx1152. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1153. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1200. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx1201. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx900. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for gfx950. In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu:3: In file included from /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../fattn-vec-f32.cuh:1: /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-cuda/template-instances/../common.cuh:270:42: warning: unused parameter 'cc' [-Wunused-parameter] 270 | static bool fp16_mma_available(const int cc) { | ^ 1 warning generated when compiling for host. [ 62%] Linking CXX shared library ../../../bin/libggml-hip.so cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/cmake -E cmake_link_script CMakeFiles/ggml-hip.dir/link.txt --verbose=1 /usr/bin/hipcc -fPIC -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,libggml-hip.so.b6153 -o ../../../bin/libggml-hip.so.b6153 "CMakeFiles/ggml-hip.dir/__/ggml-cuda/acc.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/add-id.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/arange.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/argmax.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/argsort.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/binbcast.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/clamp.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/concat.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv-transpose-1d.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-dw.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/conv2d-transpose.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/convert.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/count-equal.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/cpy.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/cross-entropy-loss.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/diagmask.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-tile-f32.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn-wmma-f16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/fattn.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/getrows.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/ggml-cuda.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/gla.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/im2col.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/mean.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmf.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmq.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvf.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/mmvq.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/norm.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/opt-step-adamw.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/out-prod.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/pad.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/pool2d.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/quantize.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/roll.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/rope.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/scale.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/set-rows.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/softcap.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/softmax.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-conv.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/ssm-scan.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/sum.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/sumrows.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/tsembd.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/unary.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/upscale.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/wkv.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_1-ncols2_8.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_1.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_2.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_16-ncols2_4.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_4.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_2-ncols2_8.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_1.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_32-ncols2_2.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_2.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_4.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_4-ncols2_8.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_64-ncols2_1.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_1.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_2.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_4.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-mma-f16-instance-ncols1_8-ncols2_8.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq1_s.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_s.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xs.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq2_xxs.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_s.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq3_xxs.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_nl.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-iq4_xs.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-mxfp4.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q2_k.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q3_k.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_0.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_1.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q4_k.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_0.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_1.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q5_k.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q6_k.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/mmq-instance-q8_0.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q4_0-q4_0.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q4_0-q4_0.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-q8_0-q8_0.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-q8_0-q8_0.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs128-f16-f16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs256-f16-f16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f16-instance-hs64-f16-f16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs128-f16-f16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs256-f16-f16.cu.o" "CMakeFiles/ggml-hip.dir/__/ggml-cuda/template-instances/fattn-vec-f32-instance-hs64-f16-f16.cu.o" ../../../bin/libggml-base.so.b6153 /usr/lib64/libhipblas.so.2.4 --hip-link --offload-arch=gfx900 --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx90a:xnack- --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1010 --offload-arch=gfx1012 --offload-arch=gfx1030 --offload-arch=gfx1031 --offload-arch=gfx1035 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1103 --offload-arch=gfx1150 --offload-arch=gfx1151 --offload-arch=gfx1152 --offload-arch=gfx1153 --offload-arch=gfx1200 --offload-arch=gfx1201 /usr/lib64/librocblas.so.4.4 /usr/lib64/libamdhip64.so.6.4.43484 clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/ggml-hip && /usr/bin/cmake -E cmake_symlink_library ../../../bin/libggml-hip.so.b6153 ../../../bin/libggml-hip.so.b6153 ../../../bin/libggml-hip.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 62%] Built target ggml-hip /usr/bin/gmake -f ggml/src/CMakeFiles/ggml.dir/build.make ggml/src/CMakeFiles/ggml.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/ggml/src /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src/CMakeFiles/ggml.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f ggml/src/CMakeFiles/ggml.dir/build.make ggml/src/CMakeFiles/ggml.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 62%] Building CXX object ggml/src/CMakeFiles/ggml.dir/ggml-backend-reg.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_BUILD -DGGML_SCHED_MAX_COPIES=4 -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -D_GNU_SOURCE -D_XOPEN_SOURCE=600 -Dggml_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -std=gnu++17 -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT ggml/src/CMakeFiles/ggml.dir/ggml-backend-reg.cpp.o -MF CMakeFiles/ggml.dir/ggml-backend-reg.cpp.o.d -o CMakeFiles/ggml.dir/ggml-backend-reg.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/ggml/src/ggml-backend-reg.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 63%] Linking CXX shared library ../../bin/libggml.so cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/cmake -E cmake_link_script CMakeFiles/ggml.dir/link.txt --verbose=1 /usr/bin/hipcc -fPIC -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,libggml.so.b6153 -o ../../bin/libggml.so.b6153 "CMakeFiles/ggml.dir/ggml-backend-reg.cpp.o" -ldl ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/ggml/src && /usr/bin/cmake -E cmake_symlink_library ../../bin/libggml.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 63%] Built target ggml /usr/bin/gmake -f src/CMakeFiles/llama.dir/build.make src/CMakeFiles/llama.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/src /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src/CMakeFiles/llama.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f src/CMakeFiles/llama.dir/build.make src/CMakeFiles/llama.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 64%] Building CXX object src/CMakeFiles/llama.dir/llama-adapter.cpp.o [ 65%] Building CXX object src/CMakeFiles/llama.dir/llama-batch.cpp.o [ 65%] Building CXX object src/CMakeFiles/llama.dir/llama-arch.cpp.o [ 65%] Building CXX object src/CMakeFiles/llama.dir/llama.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-batch.cpp.o -MF CMakeFiles/llama.dir/llama-batch.cpp.o.d -o CMakeFiles/llama.dir/llama-batch.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-batch.cpp cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-arch.cpp.o -MF CMakeFiles/llama.dir/llama-arch.cpp.o.d -o CMakeFiles/llama.dir/llama-arch.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-arch.cpp cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama.cpp.o -MF CMakeFiles/llama.dir/llama.cpp.o.d -o CMakeFiles/llama.dir/llama.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama.cpp cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-adapter.cpp.o -MF CMakeFiles/llama.dir/llama-adapter.cpp.o.d -o CMakeFiles/llama.dir/llama-adapter.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-adapter.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 65%] Building CXX object src/CMakeFiles/llama.dir/llama-chat.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-chat.cpp.o -MF CMakeFiles/llama.dir/llama-chat.cpp.o.d -o CMakeFiles/llama.dir/llama-chat.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-chat.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 66%] Building CXX object src/CMakeFiles/llama.dir/llama-context.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-context.cpp.o -MF CMakeFiles/llama.dir/llama-context.cpp.o.d -o CMakeFiles/llama.dir/llama-context.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-context.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 66%] Building CXX object src/CMakeFiles/llama.dir/llama-cparams.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-cparams.cpp.o -MF CMakeFiles/llama.dir/llama-cparams.cpp.o.d -o CMakeFiles/llama.dir/llama-cparams.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-cparams.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 67%] Building CXX object src/CMakeFiles/llama.dir/llama-grammar.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-grammar.cpp.o -MF CMakeFiles/llama.dir/llama-grammar.cpp.o.d -o CMakeFiles/llama.dir/llama-grammar.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-grammar.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 67%] Building CXX object src/CMakeFiles/llama.dir/llama-graph.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-graph.cpp.o -MF CMakeFiles/llama.dir/llama-graph.cpp.o.d -o CMakeFiles/llama.dir/llama-graph.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-graph.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 68%] Building CXX object src/CMakeFiles/llama.dir/llama-hparams.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-hparams.cpp.o -MF CMakeFiles/llama.dir/llama-hparams.cpp.o.d -o CMakeFiles/llama.dir/llama-hparams.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-hparams.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 68%] Building CXX object src/CMakeFiles/llama.dir/llama-impl.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-impl.cpp.o -MF CMakeFiles/llama.dir/llama-impl.cpp.o.d -o CMakeFiles/llama.dir/llama-impl.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-impl.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 69%] Building CXX object src/CMakeFiles/llama.dir/llama-io.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-io.cpp.o -MF CMakeFiles/llama.dir/llama-io.cpp.o.d -o CMakeFiles/llama.dir/llama-io.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-io.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 69%] Building CXX object src/CMakeFiles/llama.dir/llama-kv-cache-unified.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-kv-cache-unified.cpp.o -MF CMakeFiles/llama.dir/llama-kv-cache-unified.cpp.o.d -o CMakeFiles/llama.dir/llama-kv-cache-unified.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-kv-cache-unified.cpp [ 69%] Building CXX object src/CMakeFiles/llama.dir/llama-kv-cache-unified-iswa.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-kv-cache-unified-iswa.cpp.o -MF CMakeFiles/llama.dir/llama-kv-cache-unified-iswa.cpp.o.d -o CMakeFiles/llama.dir/llama-kv-cache-unified-iswa.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-kv-cache-unified-iswa.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 70%] Building CXX object src/CMakeFiles/llama.dir/llama-memory.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-memory.cpp.o -MF CMakeFiles/llama.dir/llama-memory.cpp.o.d -o CMakeFiles/llama.dir/llama-memory.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-memory.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 70%] Building CXX object src/CMakeFiles/llama.dir/llama-memory-hybrid.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-memory-hybrid.cpp.o -MF CMakeFiles/llama.dir/llama-memory-hybrid.cpp.o.d -o CMakeFiles/llama.dir/llama-memory-hybrid.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-memory-hybrid.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 71%] Building CXX object src/CMakeFiles/llama.dir/llama-memory-recurrent.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-memory-recurrent.cpp.o -MF CMakeFiles/llama.dir/llama-memory-recurrent.cpp.o.d -o CMakeFiles/llama.dir/llama-memory-recurrent.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-memory-recurrent.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 71%] Building CXX object src/CMakeFiles/llama.dir/llama-mmap.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-mmap.cpp.o -MF CMakeFiles/llama.dir/llama-mmap.cpp.o.d -o CMakeFiles/llama.dir/llama-mmap.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-mmap.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 72%] Building CXX object src/CMakeFiles/llama.dir/llama-model-loader.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-model-loader.cpp.o -MF CMakeFiles/llama.dir/llama-model-loader.cpp.o.d -o CMakeFiles/llama.dir/llama-model-loader.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-model-loader.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 72%] Building CXX object src/CMakeFiles/llama.dir/llama-model-saver.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-model-saver.cpp.o -MF CMakeFiles/llama.dir/llama-model-saver.cpp.o.d -o CMakeFiles/llama.dir/llama-model-saver.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-model-saver.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 73%] Building CXX object src/CMakeFiles/llama.dir/llama-model.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-model.cpp.o -MF CMakeFiles/llama.dir/llama-model.cpp.o.d -o CMakeFiles/llama.dir/llama-model.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-model.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 73%] Building CXX object src/CMakeFiles/llama.dir/llama-quant.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-quant.cpp.o -MF CMakeFiles/llama.dir/llama-quant.cpp.o.d -o CMakeFiles/llama.dir/llama-quant.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-quant.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 74%] Building CXX object src/CMakeFiles/llama.dir/llama-sampling.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-sampling.cpp.o -MF CMakeFiles/llama.dir/llama-sampling.cpp.o.d -o CMakeFiles/llama.dir/llama-sampling.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-sampling.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 74%] Building CXX object src/CMakeFiles/llama.dir/llama-vocab.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/llama-vocab.cpp.o -MF CMakeFiles/llama.dir/llama-vocab.cpp.o.d -o CMakeFiles/llama.dir/llama-vocab.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/llama-vocab.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 75%] Building CXX object src/CMakeFiles/llama.dir/unicode-data.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/unicode-data.cpp.o -MF CMakeFiles/llama.dir/unicode-data.cpp.o.d -o CMakeFiles/llama.dir/unicode-data.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/unicode-data.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 75%] Building CXX object src/CMakeFiles/llama.dir/unicode.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dllama_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/src/. -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT src/CMakeFiles/llama.dir/unicode.cpp.o -MF CMakeFiles/llama.dir/unicode.cpp.o.d -o CMakeFiles/llama.dir/unicode.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/src/unicode.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 76%] Linking CXX shared library ../bin/libllama.so cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama.dir/link.txt --verbose=1 /usr/bin/hipcc -fPIC -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,libllama.so.b6153 -o ../bin/libllama.so.b6153 CMakeFiles/llama.dir/llama.cpp.o "CMakeFiles/llama.dir/llama-adapter.cpp.o" "CMakeFiles/llama.dir/llama-arch.cpp.o" "CMakeFiles/llama.dir/llama-batch.cpp.o" "CMakeFiles/llama.dir/llama-chat.cpp.o" "CMakeFiles/llama.dir/llama-context.cpp.o" "CMakeFiles/llama.dir/llama-cparams.cpp.o" "CMakeFiles/llama.dir/llama-grammar.cpp.o" "CMakeFiles/llama.dir/llama-graph.cpp.o" "CMakeFiles/llama.dir/llama-hparams.cpp.o" "CMakeFiles/llama.dir/llama-impl.cpp.o" "CMakeFiles/llama.dir/llama-io.cpp.o" "CMakeFiles/llama.dir/llama-kv-cache-unified.cpp.o" "CMakeFiles/llama.dir/llama-kv-cache-unified-iswa.cpp.o" "CMakeFiles/llama.dir/llama-memory.cpp.o" "CMakeFiles/llama.dir/llama-memory-hybrid.cpp.o" "CMakeFiles/llama.dir/llama-memory-recurrent.cpp.o" "CMakeFiles/llama.dir/llama-mmap.cpp.o" "CMakeFiles/llama.dir/llama-model-loader.cpp.o" "CMakeFiles/llama.dir/llama-model-saver.cpp.o" "CMakeFiles/llama.dir/llama-model.cpp.o" "CMakeFiles/llama.dir/llama-quant.cpp.o" "CMakeFiles/llama.dir/llama-sampling.cpp.o" "CMakeFiles/llama.dir/llama-vocab.cpp.o" "CMakeFiles/llama.dir/unicode-data.cpp.o" CMakeFiles/llama.dir/unicode.cpp.o ../bin/libggml.so.b6153 ../bin/libggml-cpu.so.b6153 ../bin/libggml-hip.so.b6153 ../bin/libggml-base.so.b6153 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/src && /usr/bin/cmake -E cmake_symlink_library ../bin/libllama.so.b6153 ../bin/libllama.so.b6153 ../bin/libllama.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 76%] Built target llama /usr/bin/gmake -f common/CMakeFiles/common.dir/build.make common/CMakeFiles/common.dir/depend /usr/bin/gmake -f tools/mtmd/CMakeFiles/mtmd.dir/build.make tools/mtmd/CMakeFiles/mtmd.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd/CMakeFiles/mtmd.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/common /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common/CMakeFiles/common.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/mtmd/CMakeFiles/mtmd.dir/build.make tools/mtmd/CMakeFiles/mtmd.dir/build gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f common/CMakeFiles/common.dir/build.make common/CMakeFiles/common.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 77%] Building CXX object tools/mtmd/CMakeFiles/mtmd.dir/mtmd-audio.cpp.o [ 77%] Building CXX object tools/mtmd/CMakeFiles/mtmd.dir/mtmd.cpp.o [ 78%] Building CXX object tools/mtmd/CMakeFiles/mtmd.dir/clip.cpp.o [ 78%] Building CXX object common/CMakeFiles/common.dir/arg.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dmtmd_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/. -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/../.. -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/../../vendor -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -Wno-cast-qual -MD -MT tools/mtmd/CMakeFiles/mtmd.dir/mtmd.cpp.o -MF CMakeFiles/mtmd.dir/mtmd.cpp.o.d -o CMakeFiles/mtmd.dir/mtmd.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/mtmd.cpp cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/arg.cpp.o -MF CMakeFiles/common.dir/arg.cpp.o.d -o CMakeFiles/common.dir/arg.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/arg.cpp cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dmtmd_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/. -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/../.. -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/../../vendor -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -Wno-cast-qual -MD -MT tools/mtmd/CMakeFiles/mtmd.dir/mtmd-audio.cpp.o -MF CMakeFiles/mtmd.dir/mtmd-audio.cpp.o.d -o CMakeFiles/mtmd.dir/mtmd-audio.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/mtmd-audio.cpp cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dmtmd_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/. -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/../.. -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/../../vendor -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -Wno-cast-qual -MD -MT tools/mtmd/CMakeFiles/mtmd.dir/clip.cpp.o -MF CMakeFiles/mtmd.dir/clip.cpp.o.d -o CMakeFiles/mtmd.dir/clip.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/clip.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 79%] Building CXX object common/CMakeFiles/common.dir/chat-parser.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/chat-parser.cpp.o -MF CMakeFiles/common.dir/chat-parser.cpp.o.d -o CMakeFiles/common.dir/chat-parser.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/chat-parser.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 79%] Building CXX object common/CMakeFiles/common.dir/chat.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/chat.cpp.o -MF CMakeFiles/common.dir/chat.cpp.o.d -o CMakeFiles/common.dir/chat.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/chat.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 79%] Building CXX object tools/mtmd/CMakeFiles/mtmd.dir/mtmd-helper.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_BUILD -DLLAMA_SHARED -Dmtmd_EXPORTS -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/. -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/../.. -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/../../vendor -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -Wno-cast-qual -MD -MT tools/mtmd/CMakeFiles/mtmd.dir/mtmd-helper.cpp.o -MF CMakeFiles/mtmd.dir/mtmd-helper.cpp.o.d -o CMakeFiles/mtmd.dir/mtmd-helper.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/mtmd-helper.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 80%] Building CXX object common/CMakeFiles/common.dir/common.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/common.cpp.o -MF CMakeFiles/common.dir/common.cpp.o.d -o CMakeFiles/common.dir/common.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/common.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 81%] Linking CXX shared library ../../bin/libmtmd.so cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/cmake -E cmake_link_script CMakeFiles/mtmd.dir/link.txt --verbose=1 /usr/bin/hipcc -fPIC -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,libmtmd.so.b6153 -o ../../bin/libmtmd.so.b6153 CMakeFiles/mtmd.dir/mtmd.cpp.o "CMakeFiles/mtmd.dir/mtmd-audio.cpp.o" CMakeFiles/mtmd.dir/clip.cpp.o "CMakeFiles/mtmd.dir/mtmd-helper.cpp.o" ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] [ 81%] Building CXX object common/CMakeFiles/common.dir/console.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/console.cpp.o -MF CMakeFiles/common.dir/console.cpp.o.d -o CMakeFiles/common.dir/console.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/console.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 82%] Building CXX object common/CMakeFiles/common.dir/json-partial.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/json-partial.cpp.o -MF CMakeFiles/common.dir/json-partial.cpp.o.d -o CMakeFiles/common.dir/json-partial.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/json-partial.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 82%] Building CXX object common/CMakeFiles/common.dir/json-schema-to-grammar.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/json-schema-to-grammar.cpp.o -MF CMakeFiles/common.dir/json-schema-to-grammar.cpp.o.d -o CMakeFiles/common.dir/json-schema-to-grammar.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/json-schema-to-grammar.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/cmake -E cmake_symlink_library ../../bin/libmtmd.so.b6153 ../../bin/libmtmd.so.b6153 ../../bin/libmtmd.so gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 82%] Built target mtmd [ 83%] Building CXX object common/CMakeFiles/common.dir/llguidance.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/llguidance.cpp.o -MF CMakeFiles/common.dir/llguidance.cpp.o.d -o CMakeFiles/common.dir/llguidance.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/llguidance.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 83%] Building CXX object common/CMakeFiles/common.dir/log.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/log.cpp.o -MF CMakeFiles/common.dir/log.cpp.o.d -o CMakeFiles/common.dir/log.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/log.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 84%] Building CXX object common/CMakeFiles/common.dir/ngram-cache.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/ngram-cache.cpp.o -MF CMakeFiles/common.dir/ngram-cache.cpp.o.d -o CMakeFiles/common.dir/ngram-cache.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/ngram-cache.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 84%] Building CXX object common/CMakeFiles/common.dir/regex-partial.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/regex-partial.cpp.o -MF CMakeFiles/common.dir/regex-partial.cpp.o.d -o CMakeFiles/common.dir/regex-partial.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/regex-partial.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 85%] Building CXX object common/CMakeFiles/common.dir/sampling.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/sampling.cpp.o -MF CMakeFiles/common.dir/sampling.cpp.o.d -o CMakeFiles/common.dir/sampling.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/sampling.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 85%] Building CXX object common/CMakeFiles/common.dir/speculative.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -fPIC -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT common/CMakeFiles/common.dir/speculative.cpp.o -MF CMakeFiles/common.dir/speculative.cpp.o.d -o CMakeFiles/common.dir/speculative.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/common/speculative.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 86%] Linking CXX static library libcommon.a cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/cmake -P CMakeFiles/common.dir/cmake_clean_target.cmake cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/common && /usr/bin/cmake -E cmake_link_script CMakeFiles/common.dir/link.txt --verbose=1 /usr/bin/ar qc libcommon.a CMakeFiles/common.dir/arg.cpp.o "CMakeFiles/common.dir/chat-parser.cpp.o" CMakeFiles/common.dir/chat.cpp.o CMakeFiles/common.dir/common.cpp.o CMakeFiles/common.dir/console.cpp.o "CMakeFiles/common.dir/json-partial.cpp.o" "CMakeFiles/common.dir/json-schema-to-grammar.cpp.o" CMakeFiles/common.dir/llguidance.cpp.o CMakeFiles/common.dir/log.cpp.o "CMakeFiles/common.dir/ngram-cache.cpp.o" "CMakeFiles/common.dir/regex-partial.cpp.o" CMakeFiles/common.dir/sampling.cpp.o CMakeFiles/common.dir/speculative.cpp.o "CMakeFiles/build_info.dir/build-info.cpp.o" bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record /usr/bin/ranlib libcommon.a bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record bfd plugin: LLVM gold plugin has failed to create LTO module: Invalid record gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 86%] Built target common /usr/bin/gmake -f tools/batched-bench/CMakeFiles/llama-batched-bench.dir/build.make tools/batched-bench/CMakeFiles/llama-batched-bench.dir/depend /usr/bin/gmake -f tools/gguf-split/CMakeFiles/llama-gguf-split.dir/build.make tools/gguf-split/CMakeFiles/llama-gguf-split.dir/depend /usr/bin/gmake -f tools/imatrix/CMakeFiles/llama-imatrix.dir/build.make tools/imatrix/CMakeFiles/llama-imatrix.dir/depend /usr/bin/gmake -f tools/llama-bench/CMakeFiles/llama-bench.dir/build.make tools/llama-bench/CMakeFiles/llama-bench.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/batched-bench /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/batched-bench /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/batched-bench/CMakeFiles/llama-batched-bench.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/gguf-split /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/gguf-split /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/gguf-split/CMakeFiles/llama-gguf-split.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/imatrix /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/imatrix /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/imatrix/CMakeFiles/llama-imatrix.dir/DependInfo.cmake "--color=" gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/llama-bench /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/llama-bench /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/llama-bench/CMakeFiles/llama-bench.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/batched-bench/CMakeFiles/llama-batched-bench.dir/build.make tools/batched-bench/CMakeFiles/llama-batched-bench.dir/build gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/gguf-split/CMakeFiles/llama-gguf-split.dir/build.make tools/gguf-split/CMakeFiles/llama-gguf-split.dir/build gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/llama-bench/CMakeFiles/llama-bench.dir/build.make tools/llama-bench/CMakeFiles/llama-bench.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/imatrix/CMakeFiles/llama-imatrix.dir/build.make tools/imatrix/CMakeFiles/llama-imatrix.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 87%] Building CXX object tools/gguf-split/CMakeFiles/llama-gguf-split.dir/gguf-split.cpp.o [ 87%] Building CXX object tools/batched-bench/CMakeFiles/llama-batched-bench.dir/batched-bench.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/batched-bench && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/batched-bench/CMakeFiles/llama-batched-bench.dir/batched-bench.cpp.o -MF CMakeFiles/llama-batched-bench.dir/batched-bench.cpp.o.d -o CMakeFiles/llama-batched-bench.dir/batched-bench.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/batched-bench/batched-bench.cpp cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/gguf-split && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/gguf-split/CMakeFiles/llama-gguf-split.dir/gguf-split.cpp.o -MF CMakeFiles/llama-gguf-split.dir/gguf-split.cpp.o.d -o CMakeFiles/llama-gguf-split.dir/gguf-split.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/gguf-split/gguf-split.cpp [ 87%] Building CXX object tools/llama-bench/CMakeFiles/llama-bench.dir/llama-bench.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/llama-bench && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/llama-bench/CMakeFiles/llama-bench.dir/llama-bench.cpp.o -MF CMakeFiles/llama-bench.dir/llama-bench.cpp.o.d -o CMakeFiles/llama-bench.dir/llama-bench.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/llama-bench/llama-bench.cpp [ 88%] Building CXX object tools/imatrix/CMakeFiles/llama-imatrix.dir/imatrix.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/imatrix && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/imatrix/CMakeFiles/llama-imatrix.dir/imatrix.cpp.o -MF CMakeFiles/llama-imatrix.dir/imatrix.cpp.o.d -o CMakeFiles/llama-imatrix.dir/imatrix.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/imatrix/imatrix.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 88%] Linking CXX executable ../../bin/llama-gguf-split cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/gguf-split && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-gguf-split.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-gguf-split.dir/gguf-split.cpp.o" -o ../../bin/llama-gguf-split ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] [ 89%] Linking CXX executable ../../bin/llama-batched-bench cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/batched-bench && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-batched-bench.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-batched-bench.dir/batched-bench.cpp.o" -o ../../bin/llama-batched-bench ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 89%] Built target llama-gguf-split /usr/bin/gmake -f tools/main/CMakeFiles/llama-cli.dir/build.make tools/main/CMakeFiles/llama-cli.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/main /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/main /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/main/CMakeFiles/llama-cli.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/main/CMakeFiles/llama-cli.dir/build.make tools/main/CMakeFiles/llama-cli.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 89%] Building CXX object tools/main/CMakeFiles/llama-cli.dir/main.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/main && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/main/CMakeFiles/llama-cli.dir/main.cpp.o -MF CMakeFiles/llama-cli.dir/main.cpp.o.d -o CMakeFiles/llama-cli.dir/main.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/main/main.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 89%] Linking CXX executable ../../bin/llama-cli cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/main && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-cli.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-cli.dir/main.cpp.o" -o ../../bin/llama-cli ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] [ 89%] Linking CXX executable ../../bin/llama-imatrix cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/imatrix && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-imatrix.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-imatrix.dir/imatrix.cpp.o" -o ../../bin/llama-imatrix ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] [ 90%] Linking CXX executable ../../bin/llama-bench cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/llama-bench && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-bench.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-bench.dir/llama-bench.cpp.o" -o ../../bin/llama-bench ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 90%] Built target llama-bench /usr/bin/gmake -f tools/perplexity/CMakeFiles/llama-perplexity.dir/build.make tools/perplexity/CMakeFiles/llama-perplexity.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/perplexity /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/perplexity /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/perplexity/CMakeFiles/llama-perplexity.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/perplexity/CMakeFiles/llama-perplexity.dir/build.make tools/perplexity/CMakeFiles/llama-perplexity.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 91%] Building CXX object tools/perplexity/CMakeFiles/llama-perplexity.dir/perplexity.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/perplexity && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/perplexity/CMakeFiles/llama-perplexity.dir/perplexity.cpp.o -MF CMakeFiles/llama-perplexity.dir/perplexity.cpp.o.d -o CMakeFiles/llama-perplexity.dir/perplexity.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/perplexity/perplexity.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 91%] Linking CXX executable ../../bin/llama-perplexity cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/perplexity && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-perplexity.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-perplexity.dir/perplexity.cpp.o" -o ../../bin/llama-perplexity ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 91%] Built target llama-batched-bench /usr/bin/gmake -f tools/quantize/CMakeFiles/llama-quantize.dir/build.make tools/quantize/CMakeFiles/llama-quantize.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/quantize /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/quantize /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/quantize/CMakeFiles/llama-quantize.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/quantize/CMakeFiles/llama-quantize.dir/build.make tools/quantize/CMakeFiles/llama-quantize.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 91%] Building CXX object tools/quantize/CMakeFiles/llama-quantize.dir/quantize.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/quantize && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/tools/quantize/../../common -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/quantize/CMakeFiles/llama-quantize.dir/quantize.cpp.o -MF CMakeFiles/llama-quantize.dir/quantize.cpp.o.d -o CMakeFiles/llama-quantize.dir/quantize.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/quantize/quantize.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 91%] Built target llama-cli /usr/bin/gmake -f tools/server/CMakeFiles/llama-server.dir/build.make tools/server/CMakeFiles/llama-server.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 91%] Generating loading.html.hpp cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/server && /usr/bin/cmake -DINPUT=/builddir/build/BUILD/llama.cpp-b6153/tools/server/public/loading.html -DOUTPUT=/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/server/loading.html.hpp -P /builddir/build/BUILD/llama.cpp-b6153/scripts/xxd.cmake [ 92%] Generating index.html.gz.hpp cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/server && /usr/bin/cmake -DINPUT=/builddir/build/BUILD/llama.cpp-b6153/tools/server/public/index.html.gz -DOUTPUT=/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/server/index.html.gz.hpp -P /builddir/build/BUILD/llama.cpp-b6153/scripts/xxd.cmake [ 93%] Linking CXX executable ../../bin/llama-quantize cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/quantize && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-quantize.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-quantize.dir/quantize.cpp.o" -o ../../bin/llama-quantize ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 93%] Built target llama-quantize /usr/bin/gmake -f tools/run/CMakeFiles/llama-run.dir/build.make tools/run/CMakeFiles/llama-run.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/run /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/run /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/run/CMakeFiles/llama-run.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/run/CMakeFiles/llama-run.dir/build.make tools/run/CMakeFiles/llama-run.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 93%] Building CXX object tools/run/CMakeFiles/llama-run.dir/run.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/run && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/run/CMakeFiles/llama-run.dir/run.cpp.o -MF CMakeFiles/llama-run.dir/run.cpp.o.d -o CMakeFiles/llama-run.dir/run.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/run/run.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/server /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/server /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/server/CMakeFiles/llama-server.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/server/CMakeFiles/llama-server.dir/build.make tools/server/CMakeFiles/llama-server.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 94%] Building CXX object tools/server/CMakeFiles/llama-server.dir/server.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/server && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/tools/server -I/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/server -I/builddir/build/BUILD/llama.cpp-b6153/tools/server/../llava -I/builddir/build/BUILD/llama.cpp-b6153 -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/. -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/server/CMakeFiles/llama-server.dir/server.cpp.o -MF CMakeFiles/llama-server.dir/server.cpp.o.d -o CMakeFiles/llama-server.dir/server.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/server/server.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 94%] Built target llama-imatrix [ 95%] Building CXX object tools/run/CMakeFiles/llama-run.dir/linenoise.cpp/linenoise.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/run && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/run/CMakeFiles/llama-run.dir/linenoise.cpp/linenoise.cpp.o -MF CMakeFiles/llama-run.dir/linenoise.cpp/linenoise.cpp.o.d -o CMakeFiles/llama-run.dir/linenoise.cpp/linenoise.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/run/linenoise.cpp/linenoise.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory /usr/bin/gmake -f tools/tokenize/CMakeFiles/llama-tokenize.dir/build.make tools/tokenize/CMakeFiles/llama-tokenize.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/tokenize /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/tokenize /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/tokenize/CMakeFiles/llama-tokenize.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/tokenize/CMakeFiles/llama-tokenize.dir/build.make tools/tokenize/CMakeFiles/llama-tokenize.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 96%] Building CXX object tools/tokenize/CMakeFiles/llama-tokenize.dir/tokenize.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/tokenize && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/tokenize/CMakeFiles/llama-tokenize.dir/tokenize.cpp.o -MF CMakeFiles/llama-tokenize.dir/tokenize.cpp.o.d -o CMakeFiles/llama-tokenize.dir/tokenize.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/tokenize/tokenize.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 96%] Linking CXX executable ../../bin/llama-tokenize cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/tokenize && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-tokenize.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-tokenize.dir/tokenize.cpp.o" -o ../../bin/llama-tokenize ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 96%] Built target llama-tokenize /usr/bin/gmake -f tools/tts/CMakeFiles/llama-tts.dir/build.make tools/tts/CMakeFiles/llama-tts.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/tts /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/tts /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/tts/CMakeFiles/llama-tts.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/tts/CMakeFiles/llama-tts.dir/build.make tools/tts/CMakeFiles/llama-tts.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 97%] Building CXX object tools/tts/CMakeFiles/llama-tts.dir/tts.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/tts && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/tts/CMakeFiles/llama-tts.dir/tts.cpp.o -MF CMakeFiles/llama-tts.dir/tts.cpp.o.d -o CMakeFiles/llama-tts.dir/tts.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/tts/tts.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 97%] Linking CXX executable ../../bin/llama-run cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/run && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-run.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-run.dir/run.cpp.o" "CMakeFiles/llama-run.dir/linenoise.cpp/linenoise.cpp.o" -o ../../bin/llama-run ../../common/libcommon.a ../../bin/libllama.so.b6153 /usr/lib64/libcurl.so ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 97%] Built target llama-perplexity /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-mtmd-cli.dir/build.make tools/mtmd/CMakeFiles/llama-mtmd-cli.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd/CMakeFiles/llama-mtmd-cli.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/mtmd/CMakeFiles/llama-mtmd-cli.dir/build.make tools/mtmd/CMakeFiles/llama-mtmd-cli.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 98%] Building CXX object tools/mtmd/CMakeFiles/llama-mtmd-cli.dir/mtmd-cli.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/. -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/mtmd/CMakeFiles/llama-mtmd-cli.dir/mtmd-cli.cpp.o -MF CMakeFiles/llama-mtmd-cli.dir/mtmd-cli.cpp.o.d -o CMakeFiles/llama-mtmd-cli.dir/mtmd-cli.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/mtmd/mtmd-cli.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 98%] Linking CXX executable ../../bin/llama-mtmd-cli cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/mtmd && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-mtmd-cli.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-mtmd-cli.dir/mtmd-cli.cpp.o" -o ../../bin/llama-mtmd-cli ../../common/libcommon.a ../../bin/libmtmd.so.b6153 /usr/lib64/libcurl.so ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] [ 98%] Linking CXX executable ../../bin/llama-tts cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/tts && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-tts.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-tts.dir/tts.cpp.o" -o ../../bin/llama-tts ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 98%] Built target llama-run /usr/bin/gmake -f tools/cvector-generator/CMakeFiles/llama-cvector-generator.dir/build.make tools/cvector-generator/CMakeFiles/llama-cvector-generator.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/cvector-generator /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/cvector-generator /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/cvector-generator/CMakeFiles/llama-cvector-generator.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/cvector-generator/CMakeFiles/llama-cvector-generator.dir/build.make tools/cvector-generator/CMakeFiles/llama-cvector-generator.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 99%] Building CXX object tools/cvector-generator/CMakeFiles/llama-cvector-generator.dir/cvector-generator.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/cvector-generator && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/cvector-generator/CMakeFiles/llama-cvector-generator.dir/cvector-generator.cpp.o -MF CMakeFiles/llama-cvector-generator.dir/cvector-generator.cpp.o.d -o CMakeFiles/llama-cvector-generator.dir/cvector-generator.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/cvector-generator/cvector-generator.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory [ 99%] Linking CXX executable ../../bin/llama-cvector-generator cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/cvector-generator && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-cvector-generator.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-cvector-generator.dir/cvector-generator.cpp.o" -o ../../bin/llama-cvector-generator ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [ 99%] Built target llama-mtmd-cli /usr/bin/gmake -f tools/export-lora/CMakeFiles/llama-export-lora.dir/build.make tools/export-lora/CMakeFiles/llama-export-lora.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/llama.cpp-b6153 /builddir/build/BUILD/llama.cpp-b6153/tools/export-lora /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/export-lora /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/export-lora/CMakeFiles/llama-export-lora.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/gmake -f tools/export-lora/CMakeFiles/llama-export-lora.dir/build.make tools/export-lora/CMakeFiles/llama-export-lora.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [100%] Building CXX object tools/export-lora/CMakeFiles/llama-export-lora.dir/export-lora.cpp.o cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/export-lora && /usr/bin/hipcc -DGGML_BACKEND_SHARED -DGGML_SHARED -DGGML_USE_CPU -DGGML_USE_CUDA -DGGML_USE_HIP -DLLAMA_SHARED -DLLAMA_USE_CURL -I/builddir/build/BUILD/llama.cpp-b6153/common/. -I/builddir/build/BUILD/llama.cpp-b6153/common/../vendor -I/builddir/build/BUILD/llama.cpp-b6153/src/../include -I/builddir/build/BUILD/llama.cpp-b6153/ggml/src/../include -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wmissing-declarations -Wmissing-noreturn -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -MD -MT tools/export-lora/CMakeFiles/llama-export-lora.dir/export-lora.cpp.o -MF CMakeFiles/llama-export-lora.dir/export-lora.cpp.o.d -o CMakeFiles/llama-export-lora.dir/export-lora.cpp.o -c /builddir/build/BUILD/llama.cpp-b6153/tools/export-lora/export-lora.cpp sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [100%] Built target llama-tts [100%] Linking CXX executable ../../bin/llama-export-lora cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/export-lora && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-export-lora.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-export-lora.dir/export-lora.cpp.o" -o ../../bin/llama-export-lora ../../common/libcommon.a ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 /usr/lib64/libcurl.so sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] [100%] Linking CXX executable ../../bin/llama-server cd /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/tools/server && /usr/bin/cmake -E cmake_link_script CMakeFiles/llama-server.dir/link.txt --verbose=1 /usr/bin/hipcc -O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -DNDEBUG -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes "CMakeFiles/llama-server.dir/server.cpp.o" -o ../../bin/llama-server ../../common/libcommon.a ../../bin/libmtmd.so.b6153 /usr/lib64/libcurl.so ../../bin/libllama.so.b6153 ../../bin/libggml.so.b6153 ../../bin/libggml-cpu.so.b6153 ../../bin/libggml-hip.so.b6153 ../../bin/libggml-base.so.b6153 sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [100%] Built target llama-cvector-generator gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [100%] Built target llama-export-lora gmake[2]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' [100%] Built target llama-server gmake[1]: Leaving directory '/builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build' /usr/bin/cmake -E cmake_progress_start /builddir/build/BUILD/llama.cpp-b6153/redhat-linux-build/CMakeFiles 0 + RPM_EC=0 ++ jobs -p + exit 0 Executing(%install): /bin/sh -e /var/tmp/rpm-tmp.byXQea + umask 022 + cd /builddir/build/BUILD + '[' /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64 '!=' / ']' + rm -rf /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64 ++ dirname /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64 + mkdir -p /builddir/build/BUILDROOT + mkdir /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64 + CFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection ' + export CFLAGS + CXXFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection' + export CXXFLAGS + FFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -flto=thin -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64-v3 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + cd llama.cpp-b6153 + DESTDIR=/builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64 + /usr/bin/cmake --install redhat-linux-build -- Install configuration: "Release" -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libggml-cpu.so.b6153 -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libggml-cpu.so -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libggml-hip.so.b6153 -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libggml-hip.so -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libggml.so.b6153 -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libggml.so -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-cpu.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-alloc.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-backend.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-blas.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-cann.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-cpp.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-cuda.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-opt.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-metal.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-rpc.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-sycl.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-vulkan.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/ggml-webgpu.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/gguf.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libggml-base.so.b6153 -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libggml-base.so -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/cmake/ggml/ggml-config.cmake -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/cmake/ggml/ggml-version.cmake -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-batched-bench -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-gguf-split -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-imatrix -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-bench -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-cli -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-perplexity -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-quantize -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-server -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-run -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-tokenize -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-tts -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libmtmd.so.b6153 -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libmtmd.so -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/mtmd.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/mtmd-helper.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-mtmd-cli -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-cvector-generator -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/llama-export-lora -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libllama.so.b6153 -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libllama.so -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/llama.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/include/llama-cpp.h -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/cmake/llama/llama-config.cmake -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/cmake/llama/llama-version.cmake -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/convert_hf_to_gguf.py -- Installing: /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/pkgconfig/llama.pc + rm -rf '/builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/lib64/libggml_shared.*' + rm /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/bin/convert_hf_to_gguf.py + /usr/bin/find-debuginfo -j4 --strict-build-id -m -i --build-id-seed b6153-1.el10 --unique-debug-suffix -b6153-1.el10.x86_64 --unique-debug-src-base llama-cpp-b6153-1.el10.x86_64 --run-dwz --dwz-low-mem-die-limit 10000000 --dwz-max-die-limit 110000000 -S debugsourcefiles.list /builddir/build/BUILD/llama.cpp-b6153 find-debuginfo: starting Extracting debug info from 20 files DWARF-compressing 20 files dwz: ./usr/bin/llama-batched-bench-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-bench-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-cli-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-cvector-generator-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-export-lora-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-gguf-split-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-imatrix-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-mtmd-cli-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-perplexity-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-quantize-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-run-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-server-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-tokenize-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/bin/llama-tts-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/lib64/libggml-base.so.b6153-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/lib64/libggml-cpu.so.b6153-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/lib64/libggml-hip.so.b6153-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/lib64/libggml.so.b6153-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/lib64/libllama.so.b6153-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: ./usr/lib64/libmtmd.so.b6153-b6153-1.el10.x86_64.debug: Unknown debugging section .debug_str_offsets dwz: Too few files for multifile optimization sepdebugcrcfix: Updated 0 CRC32s, 20 CRC32s did match. Creating .debug symlinks for symlinks to ELF files Copying sources found by 'debugedit -l' to /usr/src/debug/llama-cpp-b6153-1.el10.x86_64 find-debuginfo: done + /usr/lib/rpm/check-buildroot + /usr/lib/rpm/redhat/brp-ldconfig + /usr/lib/rpm/brp-compress + /usr/lib/rpm/redhat/brp-strip-lto /usr/bin/strip + /usr/lib/rpm/brp-strip-static-archive /usr/bin/strip + /usr/lib/rpm/check-rpaths + /usr/lib/rpm/redhat/brp-mangle-shebangs + /usr/lib/rpm/brp-remove-la-files + /usr/lib/rpm/redhat/brp-python-rpm-in-distinfo + env /usr/lib/rpm/redhat/brp-python-bytecompile '' 1 0 -j4 + /usr/lib/rpm/redhat/brp-python-hardlink Processing files: llama-cpp-b6153-1.el10.x86_64 Executing(%license): /bin/sh -e /var/tmp/rpm-tmp.Vfsq5M + umask 022 + cd /builddir/build/BUILD + cd llama.cpp-b6153 + LICENSEDIR=/builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/share/licenses/llama-cpp + export LC_ALL= + LC_ALL= + export LICENSEDIR + /usr/bin/mkdir -p /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/share/licenses/llama-cpp + cp -pr /builddir/build/BUILD/llama.cpp-b6153/LICENSE /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/share/licenses/llama-cpp + RPM_EC=0 ++ jobs -p + exit 0 Provides: libggml-base.so.b6153()(64bit) libggml-cpu.so.b6153()(64bit) libggml-hip.so.b6153()(64bit) libggml.so.b6153()(64bit) libllama.so.b6153()(64bit) libmtmd.so.b6153()(64bit) llama-cpp = b6153-1.el10 llama-cpp(x86-64) = b6153-1.el10 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Requires: ld-linux-x86-64.so.2()(64bit) ld-linux-x86-64.so.2(GLIBC_2.3)(64bit) libamdhip64.so.6()(64bit) libamdhip64.so.6(hip_4.2)(64bit) libamdhip64.so.6(hip_6.0)(64bit) libc.so.6()(64bit) libc.so.6(GLIBC_2.10)(64bit) libc.so.6(GLIBC_2.14)(64bit) libc.so.6(GLIBC_2.17)(64bit) libc.so.6(GLIBC_2.2.5)(64bit) libc.so.6(GLIBC_2.29)(64bit) libc.so.6(GLIBC_2.3.2)(64bit) libc.so.6(GLIBC_2.3.4)(64bit) libc.so.6(GLIBC_2.32)(64bit) libc.so.6(GLIBC_2.33)(64bit) libc.so.6(GLIBC_2.34)(64bit) libc.so.6(GLIBC_2.38)(64bit) libc.so.6(GLIBC_2.4)(64bit) libc.so.6(GLIBC_2.7)(64bit) libc.so.6(GLIBC_ABI_DT_RELR)(64bit) libcurl.so.4()(64bit) libgcc_s.so.1()(64bit) libgcc_s.so.1(GCC_3.0)(64bit) libggml-base.so.b6153()(64bit) libggml-cpu.so.b6153()(64bit) libggml-hip.so.b6153()(64bit) libggml.so.b6153()(64bit) libhipblas.so.2()(64bit) libllama.so.b6153()(64bit) libm.so.6()(64bit) libm.so.6(GLIBC_2.2.5)(64bit) libm.so.6(GLIBC_2.27)(64bit) libm.so.6(GLIBC_2.29)(64bit) libmtmd.so.b6153()(64bit) libstdc++.so.6()(64bit) libstdc++.so.6(CXXABI_1.3)(64bit) libstdc++.so.6(CXXABI_1.3.11)(64bit) libstdc++.so.6(CXXABI_1.3.13)(64bit) libstdc++.so.6(CXXABI_1.3.2)(64bit) libstdc++.so.6(CXXABI_1.3.3)(64bit) libstdc++.so.6(CXXABI_1.3.5)(64bit) libstdc++.so.6(CXXABI_1.3.7)(64bit) libstdc++.so.6(CXXABI_1.3.9)(64bit) libstdc++.so.6(GLIBCXX_3.4)(64bit) libstdc++.so.6(GLIBCXX_3.4.11)(64bit) libstdc++.so.6(GLIBCXX_3.4.14)(64bit) libstdc++.so.6(GLIBCXX_3.4.15)(64bit) libstdc++.so.6(GLIBCXX_3.4.17)(64bit) libstdc++.so.6(GLIBCXX_3.4.18)(64bit) libstdc++.so.6(GLIBCXX_3.4.19)(64bit) libstdc++.so.6(GLIBCXX_3.4.20)(64bit) libstdc++.so.6(GLIBCXX_3.4.21)(64bit) libstdc++.so.6(GLIBCXX_3.4.22)(64bit) libstdc++.so.6(GLIBCXX_3.4.25)(64bit) libstdc++.so.6(GLIBCXX_3.4.26)(64bit) libstdc++.so.6(GLIBCXX_3.4.29)(64bit) libstdc++.so.6(GLIBCXX_3.4.30)(64bit) libstdc++.so.6(GLIBCXX_3.4.32)(64bit) libstdc++.so.6(GLIBCXX_3.4.9)(64bit) Recommends: numactl Processing files: llama-cpp-devel-b6153-1.el10.x86_64 Executing(%doc): /bin/sh -e /var/tmp/rpm-tmp.0yckKc + umask 022 + cd /builddir/build/BUILD + cd llama.cpp-b6153 + DOCDIR=/builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/share/doc/llama-cpp-devel + export LC_ALL= + LC_ALL= + export DOCDIR + /usr/bin/mkdir -p /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/share/doc/llama-cpp-devel + cp -pr /builddir/build/BUILD/llama.cpp-b6153/README.md /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64/usr/share/doc/llama-cpp-devel + RPM_EC=0 ++ jobs -p + exit 0 Provides: cmake(ggml) cmake(llama) llama-cpp-devel = b6153-1.el10 llama-cpp-devel(x86-64) = b6153-1.el10 pkgconfig(llama) = 0.0.0 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Requires: /usr/bin/pkg-config cmake-filesystem(x86-64) libggml-base.so.b6153()(64bit) libggml-cpu.so.b6153()(64bit) libggml-hip.so.b6153()(64bit) libggml.so.b6153()(64bit) libllama.so.b6153()(64bit) libmtmd.so.b6153()(64bit) Processing files: llama-cpp-debugsource-b6153-1.el10.x86_64 Provides: llama-cpp-debugsource = b6153-1.el10 llama-cpp-debugsource(x86-64) = b6153-1.el10 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Processing files: llama-cpp-debuginfo-b6153-1.el10.x86_64 Provides: debuginfo(build-id) = 030f675679b5628c33237c57f8e94207ae988ddd debuginfo(build-id) = 1777f49d5d007f443bcd0a67aa0aeb2a0704cd2d debuginfo(build-id) = 2058754c5baaf238b6d233094a50da7559f48136 debuginfo(build-id) = 2443a81f7d9127c2b9a58c01523d1215b3f03558 debuginfo(build-id) = 2b29ded490bda5acb0819c4218bdd892e63c1628 debuginfo(build-id) = 36ec3840d5ddd547330bae4625de5f5e53608c5d debuginfo(build-id) = 3a96483cb98abf7b9f34457a18dc9659fb2a5c7e debuginfo(build-id) = 3e2f94c7104877bbaa3b38ec255c8c4e797566f9 debuginfo(build-id) = 557ad9c5c5789fb7a68019f5d211c2d8a9b970b1 debuginfo(build-id) = 5db3f104af0a56ee48e76e48a68748273c524010 debuginfo(build-id) = 5f0a18d9f10ef889d56d66c0752bf043d92aca10 debuginfo(build-id) = 6aaeea64b3404a2c89aba59d512062e495077b78 debuginfo(build-id) = 7ceea55fcfc4178644214dac8be6d91bdbd7ac75 debuginfo(build-id) = 9063fb3cf7a78c44d6f158d0005bcff8c73e90bb debuginfo(build-id) = 9ab69f4ee629e2f7ec5fcdb76f81134132dd36b5 debuginfo(build-id) = c0eb247a78f98a8a868bfdbbfefb6a3675feb7d6 debuginfo(build-id) = c39c0e17ddae5069be2b0f887683325cef09e432 debuginfo(build-id) = e1c20bbd0e76d2622e2dc9a0f5d61f84c3c8e8a3 debuginfo(build-id) = f2317aad9a292a0cd5d82db6da0c4fc8aab8ebdb debuginfo(build-id) = f2b8e7f28a8c5123a727ac4f86af50ba6bdce9a5 libggml-base.so.b6153-b6153-1.el10.x86_64.debug()(64bit) libggml-cpu.so.b6153-b6153-1.el10.x86_64.debug()(64bit) libggml-hip.so.b6153-b6153-1.el10.x86_64.debug()(64bit) libggml.so.b6153-b6153-1.el10.x86_64.debug()(64bit) libllama.so.b6153-b6153-1.el10.x86_64.debug()(64bit) libmtmd.so.b6153-b6153-1.el10.x86_64.debug()(64bit) llama-cpp-debuginfo = b6153-1.el10 llama-cpp-debuginfo(x86-64) = b6153-1.el10 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Recommends: llama-cpp-debugsource(x86-64) = b6153-1.el10 Checking for unpackaged file(s): /usr/lib/rpm/check-files /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64 Wrote: /builddir/build/RPMS/llama-cpp-devel-b6153-1.el10.x86_64.rpm Wrote: /builddir/build/RPMS/llama-cpp-debugsource-b6153-1.el10.x86_64.rpm Wrote: /builddir/build/RPMS/llama-cpp-debuginfo-b6153-1.el10.x86_64.rpm Wrote: /builddir/build/RPMS/llama-cpp-b6153-1.el10.x86_64.rpm Executing(%clean): /bin/sh -e /var/tmp/rpm-tmp.fRoKvB + umask 022 + cd /builddir/build/BUILD + cd llama.cpp-b6153 + /usr/bin/rm -rf /builddir/build/BUILDROOT/llama-cpp-b6153-1.el10.x86_64 + RPM_EC=0 ++ jobs -p + exit 0 Executing(rmbuild): /bin/sh -e /var/tmp/rpm-tmp.JBG26p + umask 022 + cd /builddir/build/BUILD + rm -rf /builddir/build/BUILD/llama.cpp-b6153-SPECPARTS + rm -rf llama.cpp-b6153 llama.cpp-b6153.gemspec + RPM_EC=0 ++ jobs -p + exit 0 Finish: rpmbuild llama-cpp-b6153-1.el10.src.rpm Finish: build phase for llama-cpp-b6153-1.el10.src.rpm INFO: chroot_scan: 3 files copied to /var/lib/copr-rpmbuild/results/chroot_scan INFO: /var/lib/mock/rhel+epel-10-x86_64-1766268696.705688/root/var/log/dnf.rpm.log /var/lib/mock/rhel+epel-10-x86_64-1766268696.705688/root/var/log/dnf.librepo.log /var/lib/mock/rhel+epel-10-x86_64-1766268696.705688/root/var/log/dnf.log INFO: chroot_scan: creating tarball /var/lib/copr-rpmbuild/results/chroot_scan.tar.gz /bin/tar: Removing leading `/' from member names INFO: Done(/var/lib/copr-rpmbuild/results/llama-cpp-b6153-1.el10.src.rpm) Config(child) 89 minutes 24 seconds INFO: Results and/or logs in: /var/lib/copr-rpmbuild/results INFO: Cleaning up build root ('cleanup_on_success=True') Start: clean chroot INFO: unmounting tmpfs. Finish: clean chroot Finish: run Running RPMResults tool Package info: { "packages": [ { "name": "llama-cpp-debuginfo", "epoch": null, "version": "b6153", "release": "1.el10", "arch": "x86_64" }, { "name": "llama-cpp-debugsource", "epoch": null, "version": "b6153", "release": "1.el10", "arch": "x86_64" }, { "name": "llama-cpp-devel", "epoch": null, "version": "b6153", "release": "1.el10", "arch": "x86_64" }, { "name": "llama-cpp", "epoch": null, "version": "b6153", "release": "1.el10", "arch": "src" }, { "name": "llama-cpp", "epoch": null, "version": "b6153", "release": "1.el10", "arch": "x86_64" } ] } RPMResults finished